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A method for preventing or treating adverse conditions which may be 
reduced or abolished by modulating the activity of one or more 
phosphodiesterasas having the ability to cleave cGMP. 

SUMMARY OF THE INVENTION 

This application describes a method to identify novel chemical entities that can maintain 
or increase cGMP levels within living cells. Such compounds will specifically modulate 
the effectiveness of the cyclic GMP-cleaving phosphodiesterases, for example PDE5, 
found in smooth muscle cells of vascular tissue. The preferred mode of action being 
sought is dislocation of specific isoforms of these cyclic nucleotide phosphodiesterases 
(hereafter referred to as PDE;s) from their anchoring sites within cells, thereby reducing 
their specific effectiveness, not their enzymatic capacity. 

In its broadest aspect, the present application relates to a novel method for preventing or 
treating, in an animal in need thereof, an adverse condition which may be reduced or 
abolished by modulating the activity of one or more PDE;s having the ability to cleave 
cGMP. The method comprises modulation of the specific effectiveness of PDE or 
PDE:s by modulating their spatial distribution within cells of the animal. 
The PDE is chosen from the group consisting of PDE 1, PDE 2, PDE 5, PDE 6, and 
PDE 9. More specifically, the method relates to mPDE5 or cPDE5 and splice variants 
thereof. The animal with the adverse condition may be a mammal and preferably a 
human. 

In one embodiment of the invention modulation of the specific effectiveness of the PDE 
is a dislocation of the PDE from a native location within the cell. 

In another embodiment of the invention modulation of the specific effectiveness of the 
PDE involves a disruption of its targeting to a native location within the cell. 
In another embodiment of the invention modulation of the specific effectiveness of the 
PDE involves interference with the redistribution of the PDE, the redistribution being 
associated with an increase or a decrease of the specific effectiveness of the PDE. 
The modulation of the specific effectiveness of the PDE may involve both an up- 
regulation or a down-regulation of the effectiveness of the PDE to perform its function 
within the cell. 

This patent application is associated with the patent application "An improved 
method..." enclosed hereby as appendix A. Appendix A is considered part of the 
application. 
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Very many extracellular signals are transduced via intracellular systems employing the 
cyclic nucleotides cyclic adenosine monophosphate (cAMP) and cyclic guanosine 
monophosphate (cGMP) as intermediaries, or second messengers. The processes 
mediated by cAMP and cGMP include control of smooth muscle tone, learning, vision, 
cellular differentiation, control of pro-inflammatory mediator production and action, 
apoptosis, lipogenesis, glycogenolysis and gluconeogenesis, circadian rhythms, cardiac 
function, and mood control through noradrenergic potentiation. 

Cyclic nucleotides are generated by adenylate and guanylate cyclases (ACs and GCs, 
respectively) from ATP and GTP, signal to cAMP- and cGMP-dependent protein 
kinases (cAKs and cGKs, respectively) and directly to certain ion channels. cAMP and 
cGMP are removed by PDE;s. The required specificity of signals generated by these 
systems arises from diversity of type, tissue-specific expression and intracellular 
placement of the enzymes involved. For instance there are nine isoforms of ACs known 
plus additional splice variants, soluble and membrane located forms of GCs, multiple 
isoforms of the cAK and cGK kinases, and very many isoforms of PDEs of which over 
30 have been identified (Perry and Higgs, 1998; Houslay and Milligan, 1997; Beavo, 
1995). Additional specificity arises from targeting particular signalling enzymes to 
restricted locations within cells; this is the function of scaffold and anchoring proteins, 
such as the AKAP family, and not only may they place enzymes close to their 
substrates, but may also serve to recruit multiple enzymes into functional signalling 
units (Pawson and Scott, 1997). 

Two groups of GCs are known in mammals, the soluble ones and those that are 
membrane located. GCs from both groups are central to systemic control of blood 
pressure. Soluble GCs are expressed in almost all cell types of the cardiovascular 
system including cardiomyocytes, vascular smooth muscle cells (VSMCs), endothelial 
cells and platelets (Drewett and Garbers, 1994). Soluble GCs contain a prosthetic heme 
group which binds NO (and CO) and leads to activation of the enzyme: the vasoactive 
properties of NO are mediated through the cGMP pathway in this way. The membrane 
located GCs act as receptors for various ligands (among them, natriuretic peptides and 
guanylin). cGMP-mediated functions of the natriuretic hormone receptors include 
vascular smooth muscle relaxation as well as regulation of blood volume (Benner et ah, 
1990). 

cGMP interacts with a number of different effector proteins: 

a) with certain ion channels e.g. in photoreceptors and olfactory cells, 
also in heart and kidney (Lincoln & Cornwell, 1993; Biel et al y 1994; 
Light et aL, 1990); 

b) with cGMP-dependent protein kinases (cGKI and cGKJI), of which 
"cytosolic" cGKI predominates in the cardiovascular system and has at 
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least 2 splice variants, a and p. cGKla has 10-fold higher affinity for 

cGMP than the (3 variant. Both cGKI variants are found in vascular 

smooth muscle (Keilbach et al, 1992, Hofmann et al, 1992); 

c) at high concentrations, with cAMP-dependent protein kinases (cAK), 
which being similar to the cGKs have a certain affinity for cGMP, just as 
the reverse is also true (Vaandrager & de Jonge, 1996). The functional 
significance of this potential cross-talk between pathways is not yet fully 
known, but may be connected with the anti-proliferative effects of cGMP 
(Lincoln etal, 1994); 

d) with cGMP-modulated PDEs: cGMP binds to a non-catalytic site of 
PDE2 and lowers its K m for cAMP, lowering the baseline level of cAMP 
achievable by the enzyme. PDE3 catalysis of cAMP is effectively 
inhibited by cGMP (Pyne et al, 1987), thus in cells where PDE3 
predominates, increased cGMP leads to increased cAMP. 

Smooth muscle contracts following Ca 2+ -calmodulin activation of myosin light chain 
kinase (MLCK). cGKI relaxes smooth muscle by lowering free cytoplasmic Ca 2+ 
levels, but the principal means by which this is accomplished varies considerably 
between types of smooth muscle, animal species, and the nature of the contractile 
stimulus being antagonised (Vaandrager & de Jonge, 1996). cGKI has been implicated 

in: inhibition of G-protein activation of phospholipase C (3; activation of Ca 2+ -ATPase 

activity at plasma membrane and sarcoplasmic reticulum (SR); hyperpolarisation of 
membrane potential through activation of Ca 2+ -activated K + channels; inhibition of 
voltage operated Ca 2+ channels; stimulation of the Na7Ca 2+ exchanger; inhibition of SR 
EP 3 receptors. All of these actions require that the normally cytoplasmic cGKs must find 
membrane located targets, and specific anchor proteins may be involved. cGKI is 
already known to be targeted to specific anchor proteins of the cytoskeleton (MacMillan- 
Crow & Lincoln, 1994), and the discovery of further interactions is likely. 
Inactivation of cAMP/cGMP occurs by hydrolysis of the 3'-ester bond, catalysed by the 
PDEs. The major cGMP-degrading PDEs are types 1,2,5, 6 and 9 but here we focus on 
PDE5, since this is the principal cGMP- specific PDE found in airway and vascular 
smooth muscle, and it is one of the better documented families of cGMP-specific PDEs. 
Little is known yet concerning the role of the newly discovered PDE9 isoforms 
(Soderling et al, 1998; Fisher et al, 1998), and the situation is similar for PDE2s, 
since good inhibitors are as yet unknown for these (Perry and Higgs, 1998). PDES is 
activated by cAK and (10-fold faster) by cGK (Thomas et al, 1990). Phosphorylation 
of PDE5 is enhanced in the presence of cGMP, and apparently increases the enzyme's 
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V niax by 10-fold (Burns et a/., 1992). Coupled with PDE3, these interactions form a 
feedback system to limit cGMP signaling: increased cGMP will increase cAMP through 
inhibition of PDE3, high cAMP will activate cAK which, in the presence of elevated 
cGMP will activate PDE5 and therefore stimulate cGMP breakdown. cAMP levels 
return to baseline as cGMP falls, by re-activation of PDE3. Recent evidence (Pyne et 
al. y 1996; Lochhead et ai, 1997) suggests that PDE5 may have additional protein 

components associated with it analagous to the gamma subunits of PDE6. The PDE6y 

subunits serve to link activation of the G-protein transducin to activation of the PDE. 
They are subsequently involved in turning off the signal by helping to activate the 
transducin GTPase. In the case of PDE5, these associated proteins (14 to 18 kDa) may 
serve to block activation of the enzyme by cGK and cAK, and the blocking ability of 
these polypeptides appears to be controlled by a G-protein regulated kinase (Pyne et al., 
1996). 

The PDEs are key components of the cyclic nucleotide signalling systems, allowing 
local concentration differences of the cyclic nucleotide messengers to be established, 
between adjacent tissues, between adjacent cells, even within a single cell between 
different volumes of cytoplasm. The ability to generate such heterogeneity in the 
distribution of concentrations of a commonly shared signalling molecule, such as 
cAMP, is at the heart of directed signalling processes. To be of therapeutic value, cyclic 
nucleotide control has to be achieved with defined cellular selectivity (Perry and Higgs, 
1998). It is the therapeutic opportunities offered by certain of the PDEs that concerns 
this application. 

Ten families of PDEs have been identified, designated simply PDE1 to PDE10. Within 
each family there are two or more related but distinct gene products ( A, B, C, etc.) and 
for each of these alternative mRNA processing gives rise to multiple splice variants, 
identified by an additional arabic numeral in accordance with the most recent 
nomenclature recommendation (Molecular Pharmacology 46:399-405, 1994). All PDE 
gene products identified so far have two functional domains per molecule, one catalytic, 
and one regulatory. The catalytic domain lies towards the carboxylic acid terminus of 
each PDE protein and has the greatest homology between the PDE families, being 
>75% homologous at the amino acid level (Perry and Higgs, 1998). Nevertheless, each 
of the more than 30 PDEs known have individually distinct substrate specificities, 
kinetic characteristics, regulatory properties and cellular and subcellular distributions 
(Houslay andMilligan, 1997). 

PDEs 4, 7 and 8 are highly specific for cAMP. PDEs 5, 6, and 9 are selective for 
cGMP. PDE3s bind cAMP and cGMP with similar affinity, but hydrolyse cAMP most 
efficiently, cGMP rather poorly. PDE3s are therefore negatively regulated in their 
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cAMP hydrolysing ability by cGMP. PDEs 1 and 2 hydrolyse both cAMP and cGMP, 
but with PDE1 the relative efficiencies vary with isoenzyme subtype (Perry and Higgs, 
1998). 

The amino terminal ends of PDEs consist of the regulatory domains, which are very 
different both between families and between variants within families. This region 
contains variously: a binding domain for Ca 2+ -calmodulin (CaM) in PDE1; non- 
catalytic cGMP-binding sites in PDEs 2, 5 and 6; a binding domain for the signalling G- 
protein transducin in PDE6. The amino terminal region also contains membrane- 
targeting sequences in several PDE3s and PDE4s, as well as protein kinase 
phosphorylation sites in PDEs 1, 3, 4 and 5. These phosphorylation sites are likely to 
be important in regulation of catalytic activity and/or subcellular location (Perry and 
Higgs, 1998). 

This application concerns the particular therapeutic benefits arising from selective 
inhibition of PDE isoforms and variants which hydrolyse cGMP. 

Blood pressure elevation to a degree that requires medical treatment is often encountered 
in up to 15% of an adult population. In only 10-15% of these, a definite cause for the 
hypertension can be found and in the rest, the "essential hypertension" has to be treated 
without a hope for cure of the underlying disease. Long-standing elevation of blood 
pressure, even quite moderate, damages vessels in the heart, kidneys and brain and 
dramatically increases the risk for coronary heart disease, renal failure and stroke. It has 
been shown that effective pharmacologic treatment of hypertension substantially reduces 
morbidity and mortality from these conditions. The finding that endothelial cells 
produce a local vascular relaxation factor, identified as nitric oxide (NO), that activates 
guanylyl cyclase and increases cGMP that in turn leads to reduction in vascular smooth 
muscle cell tone, has opened new possibilities for blood pressure regulation / 
vasorelaxation based on modulation of the cellular levels of cGMP. A number of the 
components in the cGMP system displays tissue specific distribution (Vaandrager & de 
Jonge, 1996; Pyne et ai, 1996). This increases the likelihood for improved 
pharmacological specificity and fewer side-effects when using these as targets for 
antihypertensive treatment instead of the traditional ones. It is the cGMP-dependent 
protein kinase (PKG) (Vaandrager & de Jonge, 1996) that is thought to mediate the 
intracellular effects of cGMP. The cGMP -dependent and -specific phosphodiesterases 
can serve as connectors to the cAMP system and terminators of cGMP effects (Pyne et 
al, 1996). PDE5 has attracted attention since it is selective for degradation of cGMP 
versus cAMP. Isoform-specific inhibitors for PDE5 are being developed by several 
companies and one compound from Pfizer, Sildenafil, has proven selectivity for PDES 
and is currently being marketed as treatment against impotence (Viagra), originally a 
side-effect resulting from vasorelaxation in the corpus cavernosum. However the 
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screening procedures currently used search only for direct enzymatic inhibitors of PDE 
and the compounds found are often not selective, inhibiting for instance both PDE 1 and 
5 (e.g. Zaprinast (M&B 22948 RPR), Sch 59498 and Sch 51866). By the methods 
described herein and within appendix A, new chemical entities can be found which 
primarily will be specific modulators of PDEaction, not inhibitors of the enzymatic 
action per se. Preferred compounds will inhibit the site-specific anchoring of PDEs 
which hydrolyse cGMP, and thereby reduce their effectiveness in controlling local 
concnetrations cGMP within living cells. 

The therapeutic potential of selective modulators of cGMP-related PDE action is not 
restricted to relaxation of smooth muscle cells but also encompasses other effects 
ascribed to PKG, such as inhibition of platelet activation (Chiu et aL, 1997: Vemulapalli 
et aL, 1996), inhibition of endothelial permeability increases in response to vasoactive 
substances (Raeburn & Karlsson, 1993), inhibition of the differentiation of osteoclasts 
(Holliday et aL, 1997) and light-induced resetting of circadian rythms (Mathur et aL, 
1996; Liu et aL, 1997). 

The search for chemical inhibitors of the catalytic activity of specific PDEs is currently 
one of the most intensive areas of pharmaceutical research, particularly so for PDEs 4 
and 5. Much progress has been made in this area, with several compounds known to 
have selective activity for particular families of PDEs (reviewed in Perry and Higgs, 
1998; Hughes et aL, 1997; Teixeira et aL, 1997). However, there has not yet been 
found a class of compounds able to select between isoenzymes within the same family, 
which is where the greatest opportunities lie. Without isoform specificity, certain 
difficulties can be expected with the use of enzymic inhibitors of PDEs. Some of these 
difficulties are outlined below. 

In general, the effects a known inhibitor of the catalytic activity of a particular class of 
PDEs may have on cyclic nucleotide levels often varies between different cell types. The 
reasons for this are several, but include: differences in the basal level of cyclase activity 
in distinct cell types, crosstalk between cAMP and cGMP systems, and differences in 
local concentrations of substrate within a cell which influences the degree of inhibition 
that can be attained by a simple competitive enzyme inhibitor (Perry and Higgs, 1998). 
First, PDE inhibition is only useful if it produces the appropriate change in the activity 
of the dependent effectors, for instance activation of cAK when the concentration of 
cAMP can be increased above a threshold level. The rate of change in concentration 
depends in part on the activity of the cyclases which generate the cyclic nucleotides, and 
that basal level of activity differs from isoform to isoform, and therefore from cell type 
to cell type. In adipocytes, for example, AC activity is high and cAMP levels are kept at 
baseline only by a correspondingly high PDE activity. Hepatocytes on the other hand 
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have a rather low AC activity. If both cell types share PDEs of the same family, and are 
treated with a chemical inhibitor targeting that family, there will be a rapid increase in 
cAMP within adipocytes and activation of their cAKs, but no activation in hepatocytes, 
unless the AC is also stimulated. 

Second, general inhibition of a particular isoform of PDE can have certain unavoidable 
consequences on other cyclic nucleotide pathways since cAMP and cGMP systems are 
often closely interlinked. Much of this crosstalk arises from PDE regulation by cyclic 
nucleotides. When cGMP increases in platelets (e.g. following nitric oxide stimulation 
of soluble GC, or PDES inhibition) it inhibits PDE3 and causes a concomitant rise in 
cAMP (Ashida and Sakuma, 1992). In adrenal glomerulosa cells, atrial natriuretic factor 
elevates cGMP but inhibits c AMP- stimulated aldosterone synthesis via cGMP- 
stimulation of PDE2 (MacFarland et al, 1991). 

Third, the expected effects of PDE inhibition may be modified by differences in local 
concentrations of substrates, the reason being that most chemical inhibitors of PDE 
action are competitive with substrate, so their therapeutic profile is dependent on both 
the Michaelis-Menton equilibrium constant (K M ) and the substrate concentration in 
which they are operating (Perry and Higgs, 1998). Most effective inhibition will always 
occur at lowest substrate levels, but as a corollary, a locally increased substrate level 
will reduce the inhibition attained. In combination with subtle differences in isoform K M 
values for an inhibitor, the desired spatial modulation of cyclic nucleotide levels within a 
cell could be difficult to obtain by simple competitive inhibition of catalytic activity. 

The radically different methods of interference with PDE action as proposed below in 
this application should avoid many of the problems outlined above, principally because 
interference will be family and isoform specific and targeted not against catalytic activity 
of the PDEs, but their spatial organisation within the cell. 

Targeting of signalling enzymes is a recognised mechanism by which sensitivity, 
specificity, precision and control may be introduced into intracellular signalling 
pathways (Pawson and Scott, 1997; Faux and Scott, 1996). The importance and 
occurrence of targeting as a phenomenon are described and discussed in appendix A. Of 
central importance to this application is the modulation of the effectiveness of signalling 
PDEs through interference with their intracellular targeting. As already described, the 
many PDEs known share much structural homology, and this is especially true within 
the catalytic regions found towards the carboxylic acid terminals of the proteins. At the 
amino terminals much more heterogeneity is found, between families of PDEs, between 
isoforms within families, and between splice variants derived from individual gene 
isoforms (Houslay and Milligan, 1997). Much of this heterogeneity appears to be 
associated with differences in targeting behaviour, at least in PDE4 isoforms and 
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variants (Scotland*?/ aL, 1998, Bolger et aL, 1997), and by extension should apply to 
other PDEs as well since they are in overall character similar protein molecules with 
similar roles in cellular signalling. 

Evidence suggests that the amino terminal regions of PDEs can serve to target isoforms 
to specific intracellular sites (Shakur et aL, 1995; McPhee et aL, 1995; Bolger et aL, 
1996; Pooley et aL, 1997) and that they can regulate the functioning of the catalytic unit 
either through interaction with binding proteins (Shakur et aL, 1995; O'Connell et aL, 
1996; Pyne et aL, 1996) or through phosphorylation (Sette and Conti, 1996). Targeting 
appears to occur through protein-protein interactions with membrane- or cytoskeletally- 
located proteins (Houslay, Sullivan and Bolger, 1998), and of these the membrane 
associated proteins include both integral and peripherally adherent species. Such 
interactions have been probed at a gross level through the use of nonionic detergents 
and elevated ionic strength (Scotland et aL, 1998). 

So far only two PDE5 genes are known and two enzyme variants have been reported. 
In parallel with other PDE isoforms more splicing variants are to be expected from each 
gene. The enzyme is a homodimer, each subunit being 93 kDa. The structural 
organisation of the dimer is very similar to that of the cGKs. 

PDE5s exist in two distinct forms: one membrane-bound (mPDE5) and one cytosolic 
(cPDE5) (Pyne et aL, 1996). The mPDE5 is activated by PKA and is inhibited by a G- 
protein dependent mechanism. It is assumed that cPDE5 is part of a "signalling cassette' 1 
with NO-regulated guanylate cyclase and PDE3. The latter construction will lead to very 
short-lived messages whereas the former allows for generation of prologed cGMP 
signals 

Targeting or anchoring of PDE5s is likely to have its greatest effect through 
compartmentalisation of cGMP signalling within cells. Associated with the PDE5s will 
be specific GCs together with specific isoforms of the effector cGK, or cGMP-operated 
ion channels. cGKs may be attached to specific G-kinase anchoring proteins. Specific 
subcellular distributions of these components will allow for spatial and temporal 
gradients of cGMP to be established within cellular compartments. Targeted PDES 
species might serve to control threshold levels of cGMP in the environs of specific cGK 
molecules, perhaps protecting certain protein complexes from cGK-mediated 
phosphorylation or manipulating the activity levels of GCs that are necessary before 
cGK activation may occur. 

Competitive chemical inhibitors are known which can selectively inhibit PDE5s. 
Relatively few isoforms of PDE5 are known to date. PDE5 is found rather specifically 
in vascular and airway smooth muscle. That sildenafil, with its 5 nM IC 50 for PDES, 
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affects only a subset of vascular smooth muscle is puzzling, but strongly suggests that 
either multiple PDE5 isoforms or states exist in different vascular smooth muscle, 
presumably with different sensitivities to sildenafil, or more likely, other cGMP- 
hydrolysing PDEs are important in different vascular smooth muscles. 
As to other potentially important cGMP-hydrolysing PDE targets, many are doubtless 
yet to be discovered. PDE9s have only been known since the end of 1997. These have a 
rather general distribution (kidney, brain, lung), have a very high affinity for cGMP 

(about 70 nM) and are inhibitable by the PDE1/5 inhibitor SCH51866 (1.55 jiM), but 

"not by sildenafil" (7 [iM 7 Soderling et a/., 1998). Their physiological roles and 

regulation have not been defined (Soderling et al, 1998; Fisher et al, 1998), but the 
best suggestions are that they may be involved in keeping cGMP at very low levels 
when activated, and may, in kidney, be involved in termination of ANP signalling, and 
therefore inhibition may help potentiate natriuresis without causing deleterious drops in 
blood pressure (Soderling et al, 1998). 

It is clear that PDEs possess heterogenity in their amino terminal regions, and the 
approach outlined in this application exploits those differences between isoforms and 
splice variants to produce what should be confined and defined therapeutic effects. 
Furthermore, in many cases it may be expected that dislocation of an active enzyme 
from a targeted site of action will have little effect on average cellular concentrations of 
cGMP, although significant increases may occur at the now PDE-free site of action. 
This may have significance where an acute short-term process is the therapeutic target, 
but an integrative gene-regulation effect may occur upon general, non-specific PDE 
inhibition and overall cGMP increase in the cell. 

DETAILED DISCLOSURE 

In the present specification and claims, the term "influence" covers any influence to 
which the cellular response comprises a redistribution. Thus, e.g., heating, cooling, 
high pressure, low pressure, humidifying, or drying are influences on the cellular 
response on which the resulting redistribution can be quantified, but perhaps the most 
important influence is the influence of contacting or incubating the cell or cells with a 
substance which is known or suspected to cause a redistribution. In another 
embodiment of the invention the influence could be substances from a compound drug 
library. 

In the present context, the term "green fluorescent protein" (GFP) is intended to indicate 
a protein which, when expressed by a cell, emits fluorescence upon exposure to light of 
the correct excitation wavelength (cf. Chalfie, M. et al (1994) Science 263, 802-805). 
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In the following, GFP in which one or more amino acids have been substituted, 
inserted or deleted is also termed "modified GFP 1 '. "GFP" as used herein includes wild- 
type GFP derived from the jelly fish Aequorea victoria and modifications of GFP, such 
as the blue fluorescent variant of GFP disclosed by Heim et al. ( Heim, R. et al. (1991). 
Proc. Natl. Acad. Sri. 91:26, pp 12501-12504), and other modifications that change the 
spectral properties of the GFP fluorescence, or modifications that exhibit increased 
fluorescence when expressed in cells at a temperature above about 30°C described in 
PCT/DK96/00051, published as WO 97/11094 on 27 March 1997 and hereby 
incorporated by reference, and which comprises a fluorescent protein derived from 
Aequorea Green Fluorescent Protein or any functional anatague thereof, wherein the 
amino acid in position 1 upstream from the chromophore has been mutated to provide an 
increase of fluorescence intensity when the fluorescent protein of the invention is 
expressed in cells. Preferred GFP variants areF64I^GFP, F64LY66H-GFP and F64Lr 
S65T-GFP. An especially preferred variant of GFP for use in all the aspects of this 
invention is EGFP (DNA encoding EGFP which is a F64L-S65T variant with codons 
optimized for expression in mammalian cells is available from Clontech, Palo Alto, 
plasmids containing the EGFP DNA sequence, cf. GenBank Acc. Nos. U55762, 
U55763). 

The terms "intracellular signalling pathway" and "signal transduction pathway" are 
intended to indicate the coordinated intracellular processes whereby a living cell 
transduces an external or internal signal into cellular responses. Said signal transduction 
will involve an enzymatic reaction said enzymes include but are not limited to protein 
kinases, GTPases, ATPases, protein phosphatases, phospholipases and cyclic 
nucleotide phosphodiesterases. The cellular responses include but are not limited to 
gene transcription, secretion, proliferation, mechanical activity, metabolic activity, cell 
death. 

The term "second messenger" is used to indicate a low molecular weight component 
involved in the early events of intracellular signal transduction pathways. 

The term "luminophore" is used to indicate a chemical substance which has the property 
of emitting light either inherently or upon stimulation with chemical or physical means. 
This includes but is not limited to fluorescence, bioluminescence, phosphorescence, 
chemiluminescence. 

The term "mechanically intact living cell" is used to indicate a cell which is considered 
living according to standard criteria for that particular type of cell such as maintenance 
of normal membrane potential, energy metabolism, proliferative capability, and has not 
experienced any physically invasive treatment designed to introduce external substances 
into the cell such as microinjection. 



22132DK1 



11 



In the present context, the term "permeabilised living cell" is used to indicate cells where 

a pore forming agent such as Streptolysin O or Staphylococcus Aureus a-toxin has 

been applied and thereby incorporated into the plasma membrane in the cells. This 
creates proteinaceous pores with a defined pore size in the plasma membranes of the 
exposed cells. Pores could also be made by electroporation, i.e. exposing the cells to 
high voltage discharges, a procedure that creates small holes in the plasma membrane by 
coagulating integral membrane proteins. Treatment with a mild detergent such as 
saponin may accomplish the same thing. Common to all these treatments is that pores 
are formed only in the plasma membrane without affecting the integrity of cytoplasmic 
structural elements and organelles. The term living in this context means that the 
permeabilised cell or cells bathed in a solution mimicking the intracellular milieu still 
have functional organelles, such as actively respiring mitochondria and endoplasmatic 
reticulum that can take up and release calcium ions, and functional structural elements. 
In one embodiment this method is applied so that substances that normally can not 
traverse the plasma membrane, but most likely exert their influence intracellular^, can 
be introduced and their influence studied. In another embodiment this method is used to 
record the response to an influence from many cells simultaneously. 

In the present context, the term "permeabilisation" is intended to indicate the selective 
disruption of the plasma membrane barrier so that soluble substances freely mobile in 
the cytosol may be lost from the interior of the cells. The permeabilisation can be 
achieved as described above under "permeabilised living cells" or by using other 
chemical detergents such as Triton X-100 or digitonin in carefully titrated amounts. 

The term "physiologically relevant", when applied to an experimentally determined 
redistribution of an intracellular component, as measured by a change in the 
luminescence properties or distribution, is used to indicate that said redistribution can be 
explained in terms of the underlying biological phenomenon which gives rise to the 
redistribution. 

The terms "image processing" and "image analysis" are used to describe a large family 
of digital data analysis techniques or combination of such techniques which reduce 
ordered arrays of numbers (images) to quantitative information describing those ordered 
arrays of numbers. When said ordered arrays of numbers represent measured values 
from a physical process, the quantitative information derived is therefore a measure of 
the physical process. 

The term "mammalian cell" is intended to indicate any living cell of mammalian origin. 
The cell may be an established cell line, many of which are available from The American 
Type Culture Collection (ATCC, Virginia, USA) or a primary cell with a limited life 
span derived from a mammalian tissue, including tissues derived from a transgenic 
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animal, or a newly established immortal cell line derived from a mammalian tissue 
including transgenic tissues, or a hybrid cell or cell line derived by fusing different 
celltypes of mammalian origin e.g. hybridoma cell lines. The cells may optionally 
express one or more non-native gene products, e.g. receptors, enzymes, enzyme 
substrates, prior to or in addition to the fluorescent probe. Preferred cell lines include 
but are not limited to those of fibroblast origin, e.g. BHK, CHO, BALB, or of 
endothelial origin, e.g. HUVEC, BAE (bovine artery endothelial), CPAE (cow 
pulmonary artery endothelial), HLMVEC (human lung microvascular endothelial cells) 
or of pancreatic origin, e.g. RIN, INS-1, MIN6, bTC3, aTC6, bTC6, HIT, or of 
hematopoietic origin, e.g. primary isolated human monocytes, macrophages, 
neutrophils, basophils, eosinophils and lyphocyte populations, AML-193, HL-60, 
RBL-1, adipocyte origin, e.g. 3T3-L1, neuronal/neuroendocrine origin, e.g. AtT20, 
PC12, GH3, muscle origin, e.g. SKMC, A10, C2C12, renal origin, e.g. HEK 293, 
LLC-PK1. 

The term "hybrid polypeptide" is intended to indicate a polypeptide which is a fusion of 
at least a portion of each of two proteins, in this case at least a portion of the green 
fluorescent protein, and at least a portion of a catalytic and/or regulatory domain of a 
protein kinase. Furthermore a hybrid polypeptide is intended to indicate a fusion 
polypeptide comprising a GFP or at least a portion of the green fluorescent protein that 
contains a functional fluorophore, and at least a portion of a biologically active 
polypeptide as defined herein provided that said fusion is not the Glucocorticoid 
Receptor-GFP disclosed by Carey, KL et al. and Guiliano, KA et al., respectively. 
Thus, GFP may be N- or C-terminally tagged to a biologically active polypeptide, 
optionally via a linker portion or linker peptide consisting of a sequence of one or more 
amino acids. The hybrid polypeptide or fusion polypeptide may act as a fluorescent 
probe in mechanically intact or permeabilised living cells carrying a DNA sequence 
encoding the hybrid polypeptide under conditions permitting expression of said hybrid 
polypeptide. The term hybrid polypeptide or fusion polypeptide is intended also to 
include the term "fluorescent probe", where the latter is used to indicate a fluorescent 
fusion polypeptide comprising a GFP or any functional part thereof which is N- or C- 
terminally fused to a biologically active polypeptide as defined herein, optionally via a 
peptide linker consisting of one or more amino acid residues, where the size of the 
linker peptide in itself is not critical as long as the desired functionality of the fluorescent 
probe is maintained. A fluorescent probe according to the invention is expressed in a 
cell and basically mimics the physiological behaviour of the biologically active 
polypeptide moiety of the fusion polypeptide. 

The term "kinase" is intended to indicate an enzyme that is capable of phosphorylating a 
cellular component. 
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The term "protein kinase" is intended to indicate an enzyme that is capable of 
phosphorylating senne and/or threonine and/or tyrosine in peptides and/or proteins. 

The term "phosphatase" is intended to indicate an enzyme that is capable of 
dephosphorylating phosphoserine and/or phosphothreonine and/or phosphotyrosine in 
peptides and/or proteins. 

The term "cyclic nucleotide phosphodiesterase" is intended to indicate an enzyme that is 
capable of inactivating the second messengers cAMP and cGMP by hydrolysis of their 
3' -ester bond. 

In the present context, the term "biologically active polypeptide" is intended to indicate a 
polypeptide affecting intracellular processes upon activation, such as an enzyme which 
is active in intracellular processes or a portion thereof comprising a desired amino acid 
sequence which has a biological function or exerts a biological effect in a cellular 
system. In the polypeptide one or several amino acids may have been deleted, inserted 
and/or replaced to alter its biological function, e.g. by rendering a catalytic site inactive 
or by disrupting the targeting sequence. In another embodiment, one or several amino 
acids may have been deleted, inserted and/or replaced without altering the biological 
function of the polypeptide, that is, it remains biologically equivalent. Preferably, the 
biologically active polypeptide is selected from the group consisting of proteins taking 
part in an intracellular signalling pathway, such as enzymes involved in the intracellular 
phosphorylation and dephosphorylation processes including kinases, protein kinases 
and phosphorylases as defined herein, but also proteins making up the cytoskeleton 
play important roles in intracellular signal transduction and are therefore included in the 
meaning of "biologically active polypeptide" herein. More preferably, the biologically 
active polypeptide is a protein which according to its state as activated or non-activated 
changes localisation within the cell, preferably as an intermediary component in a signal 
transduction pathway. Included in this preferred group of biologically active 
polypeptides are cAMP dependent protein kinase A and cyclic nucleotide 
phosphodiesterases. 

The term "a substance" is intended to indicate any sample which has a biological 
function or exerts a biological effect in a cellular system. The sample may be a sample 
of a biological material such as a sample of a body fluid including blood, plasma, 
saliva, milk, urine, or a microbial or plant extract, an environmental sample containing 
pollutants including heavy metals or toxins, or it may be a sample containing a 
compound or mixture of compounds prepared by organic synthesis or genetic 
techniques. 

The phrase "any change in fluorescence" means any change in absorption properties, 
such as wavelength and intensity, or any change in spectral properties of the emitted 



22132DK1 



14 



light, such as a change of wavelength, fluorescence lifetime, intensity or polarisation, or 
any change in the intracellular localisation of the fluorophore. It may thus be localised to 
a specific cellular component (e.g. organelle, membrane, cytoskeleton, molecular 
structure) or it may be evenly distributed throughout the cell or parts of the cell. 

The term "organism" as used herein indicates any unicellular or multicellular organism 
preferably originating from the animal kingdom including protozoans, but also 
organisms that are members of the plant kingdoms, such as algae, fungi, bryophytes, 
and vascular plants are included in this definition. 

The term "nucleic acid" is intended to indicate any type of poly- or oligonucleic acid 
sequence, such as a DNA sequence, a cDNA sequence, or an RNA sequence. 

The term "biologically equivalent" as it relates to proteins is intended to mean that a first 
protein is equivalent to a second protein if the cellular functions of the two proteins may 
substitute for each other, e.g. if the two proteins are closely related isoforms encoded 
by different genes, if they are splicing variants, or allelic variants derived from the same 
gene, if they perform identical cellular functions in different cell types, or in different 
species. The term "biologically equivalent" as it relates to DNA is intended to mean that 
a first DNA sequence encoding a polypeptide is equivalent to a second DNA sequence 
encoding a polypeptide if the functional proteins encoded by the two genes are 
biologically equivalent. 

The term "fixed cells" is used to mean cells treated with a cytological fixative such as 
glutaraldehyde or formaldehyde, treatments which serve to chemically cross-link and 
stabilize soluble and insoluble proteins within the structure of the cell. Once in this state, 
such proteins cannot be lost from the structure of the now-dead cell. 
In the present context a "quantitative fluorescence redistribution assay" is intended to 
indicate an assay whereby it is possible to observe and quantify the subcelluar 
localisation and possible redistribution of an biologically active polypeptide, or part 
thereof, genetically or chemically tagged with a luminophore inside an intact living cell 
or cells or permeabilised living cells. The subcelluar location and redistribution may be 
monitored using fluorescence microscopy or fluorescence imaging microscopy but is 
preferably monitored using a fluorescence imaging plate reader or a fluorescence plate 
reader for improved throughput. A more thorough description is given in Appendix A. 
In the present context a "mortal cell line" is used to indicate animal cells that may grow 
in vitro, given the right conditions, but that have a definite life span of a number of cell 
divisions or days, week or months beyond which it is not at present possible to keep 
them alive. 

In the present context an " immortalised cell line" is used to indicate cells of animal 
origin where the normal limitations for cell life and number of cell divisions do not 
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apply. Essentially, such cells can live, grow and divide for an unlimited or very long 
(years to decades) time. 

The term "targeting sequence' 1 is used to indicate the amino-acid sequence of a 
biologically active polypeptide that contains the actual structure or structures necessary 
for association of the biologically active polypeptide with its native intracellular binding 
sites. The term "targeting sequence" is also used to indicate the amino-acid sequence of 
a protein that contains the actual structure or structures necessary for association of a 
biologically active polypeptide with the protein. 

The term "targeting" is used to indicate the process whereby a spatially distributed 
protein is directed to the intracellular sites and maintained at the intracellular sites to 
which it is normally anchored or associated. These anchoring sites are normally 
assumed to be the intracellular sites where the protein has its optimal function for the 
cell. 

The term "dislocate" and derivatives thereof is used to indicate the process whereby an 
intracellularly spatially distributed protein is forced to detach from its normal anchoring 
or association structures in the cells due to intercalation of another, preferably smaller, 
compound at the site of anchoring or association. This usually means that the optimal 
function of the protein within the cell is lost or reduced and that a larger portion of the 
protein molecules are freely mobile within the cytoplasm. 

In the present context a "screening assay" is intended to mean any measurement 
protocol, including materials, cells, instruments, chemicals, reagents, detection units, 
calibration and quantification procedures used to measure a response from mechanically 
intact or permeabilised living cells relevant to influences on an intracellular pathway. 
In the present context a "primary screening assay" is used to indicate the first screening 
assay in a discovery project that is used to select and sort all compounds available to the 
project according to the quantified effect of the compounds in the assay. 
In the present context a "counterscreen" is intended to mean a screening assay that is 
relevant to a phenomenon that is undesirable seen from the point of view of the 
discovery project. 

In the present context a "discovery project" is intended to mean the process whereby 
general or specific ideas about ways of how to modulate an intracellular signalling 
pathway are exploited in order to find new chemical compounds that can be used to 
modulate the intracellular signalling pathway and thereby treat, reduce or abolish 
symptoms associated with a condition or a disease that is lethal, degenerative, 
performance-reducing or just uncomfortable to an animal, preferably a human being. 
The aim of the discovery project is to produce drug candidates that can be tested as 
potential drugs in an animal, preferably in human beings. The term "discovery project" 
also encompasses the actual group of individuals, screening assays, tests, machinery, 
cells, animals and compounds involved in different aspects of the project. 
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The term "tagging" is used to indicate the process whereby a luminophore is genetically 
or chemically attached to the protein, or part of the protein, of interest to the discovery 
project. 

The term "primary hit" is used to indicate compounds identified in the primary screening 
assay as having at least the minimum level of desired effect that has been specified in the 
discovery project. 

The term "primary lead compound" is used to indicate a primary hit that has at least the 
minimal level of desired potency and specificity predetermined by the discovery project. 
The term "dose-response relationship" is in the present context intended to mean a clear 
correlation between the quantified response of cells in a screening assay to application 
of an influence, such as a compound, and the concentration of the applied influence. 
The response to the influence may be both an up-regulation and a down-regulation of 
the quantitated parameter used in the screening assay. 

In the present context, the term "potency" is intended to mean the ability of an influence 
to affect the process under study. The process under study may be, for example a 
screening assay or a specific physiological or patophysiological response in an animal. 
In the present context, the term "selectivity" is intended to mean the difference in 
potency on the desired process, such as a screening assay, and an undesired process, 
such as a counterscreen, with the view of the discovery project. An influence or a 
compound is said to display selectivity if the potency for the desired process is higher 
than for the undesired process. 

In the present context, the term "structure-activity relationship" or "SAR" is intended to 
mean the situation where a direct relationship exists between a compound and 
modifications made to the compound and the activity of the compound and the 
modifications made to the compound in one or more screening assays. The process of 
building a SAR may be used to direct the chemical construction of new compounds with 
higher potency and selecivity than the original compound. 

The term "drug candidate lead" is used to indicate compounds that may be pursued by a 
discovery project as potential candidates for the final outcome of the project. 
In the present context, the term "efficacy" is intended to mean the ability of a compound 
to affect the process or condition under study. It is closely related to the term "potency" 
but is in the present context used when relating to effects of a compound on more 
complex screening assays than the primary screening assay or counterscreens and when 
relating to effects of a compound in animals. 

In the present context, the term "toxicity" is intended to mean that a compound in some 
way is toxic to cells, tissues or animals. The toxicity means that the cells, tissues or 
animals will in some way be harmed if the compound is applied at a sufficient 
concentration. The effects may ultimately lead to cell, tissue or animal death or a limited 
life compared to the normal condition. 
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In the present context, the term "physiology" is intended to mean the normal function of 
biological and biochemical processes inside cells, between cells and in the whole 
organism or animal. 

In the present context, the term "pathophysiology" is intended to mean deviations from 
the normal function of biological and biochemical processes inside cells, between cells 
and in the whole organism or animal that may be part of a condition or disease. 
In the present context, the term "patogenesis" is intended to mean the process, be it 
genetical, biological, biochemical, chemical or environmental, that ultimately may 
explain, at least in part, the apparent patophysiology associated with a condition or 
disease in an animal. 

In the present context, the term "fractionated cells" is intended to mean the outcome of a 
simple division of initially mechanically intact living cells into two fractions, particulate 
(the components that can be sedimented by centrifugation at more than 10 OOOxg and 
not more than 100 OOOxg for 10 minutes) and soluble fraction (the soluble components 
and small membrane fragments that do not sediment), after subjecting the cells to 
plasma membrane disruption either mechanically with some form of homogeniser or 
sonicator or osmotically (hypoosmotic shock) or through some kind of permeabilisation 
of the plasma membrane with detergents, toxins or electroporation. 
The term "parenteral route of administration" is used to indicate the administration of a 
drug or compound in solution to an animal, such as a mammal or a human, by injection 
or infusion of the drug or compound into the bloodstream of the animal via an injection 
needle iserted into one of the animals blood vessels, preferably a vein. 
The term "oral route of administration" is used to indicate the administration of a drug 
or compound in solution or as a solid to an animal, such as a mammal or a human, by 
placing the drug or compound in the mouth of the animal so that the animal itself can 
swallow the drug or compound or have it delivered to the stomach or intestine by 
intubation. When the drug or compound enters the stomach and intestine it will be taken 
up over the mucosa into the bloodstream and administered via the blood stream to the 
tissues and organs where it is to exert its effect, or it will be acting locally in the 
stomach and intestine. 

The term "pulmonary route of administration" is used to indicate the administration of a 
drug or compound as an aerosol with either solid or liquid particles to an animal, such 
as a mammal or a human, by placing the drug or compound container close to or in 
contact with the mouth and/or nose of the animal so that the animal itself can inhale the 
drug or compound aerosol. When the drug or compound enters the peripheral 
bronchioloi and alveoli it will be taken up over the alveolar membrane, either into the 
bloodstream and administered via the blood stream to the tissues and organs where it is 
to exert its effect or it will act locally in the lungs on lung, vessel and muscle cells as 
well as any other cell type present there. 
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The term "cutaneous route of administration" is used to indicate the administration of a 
drug or compound in solution or as a solid to an animal, such as a mammal or a human, 
by placing the drug or compound on the skin of the animal. The drug can then enter the 
blood vessels under the skin as it is permeaing the skin and thereby be taken up into the 
bloodstream and administered via the blood stream to the tissues and organs where it is 
to exert its effect. It may also exert an effect locally on the site of application on the 
skin. 

The term "rectal route of administration" is used to indicate the administration of a drug 
or compound in solution or as a solid to an animal, such as a mammal or a human, by 
placing the drug or compound in the rectal cavity of the animal. When the drug or 
compound enters the rectum and parts of the large intestine it will be taken up over the 
mucosa into the bloodstream and administered via the blood stream to the tissues and 
organs where it is to exert its effect, or it will act locally in the rectum and parts of the 
large intestine. 

Very many phosphodiesterases (PDE;s) are known. They are grouped in families 
according to functional criteria. Within each family there may be several members - 
isoforms- encoded by different genes. Each isoform may give rise to several splice 
variants. This hierarchy is evidenced at the sequence level: isoforms are more similar to 
each other than to members of other families; splice variants are more similar to each 
other than to other PDE;s. Each specific PDE thus contains sequences that are unique to 
itself, as well as sequences that are shared between isoforms and/or families. 
When setting up a program to identify pharmacological agents that affect the intracellular 
distribution of a target PDE, it is first necessary to choose the target among the many 
PDE;s known. This may be done according to various criteria. A preferred embodiment 
is that the target PDE be present in the tissue or cell type(s) where the pharmacological 
agent is to exert its effect. In another preferred embodiment it is desirable that the target 
not be present in tissues or cell types where no pharmacological effects are desired. 

Establishing the expression patterns of PDE;s in relation to tissues and cell types is best 
done using the methods of detection of mRNA, e.g. Northern analysis, which is a well 
established procedure. Briefly, mRNA isolated from a given source is probed with a 
labelled nucleotide, whose sequence is complementary to the mRNA or a region in a 
mRNA of interest. The assay allows the investigator to determine the stringency of the 
probing, i.e. to correlate the resulting signal(s) with sequence similarities. 
As a first step, the nucleotide sequences of PDE;s are compiled and inspected to identify 
regions that are unique to specific PDE;s as well as regions that are shared among 
several, many, or all PDE;s. Nucleotide sequences may be found in a depository of 
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genetic information, e.g. GenBank, which is a wellknown resource. The inspection of 
the sequences may be aided by using computer programs that were developed to align 
several or many sequences, and in so doing highlighting regions of similarity or lack of 
the same. Many of these are presented and explained in great detail in e.g. Sequence 
Data Analysis Guidebook /edited by S.R.Swindell, Methods in Molecular Biology vol. 
70 (1997), from Humana Press Inc. Totowa, New Jersey, 

When sequences have been identified that are unique to a PDE, or respectively shared 
by several or many PDE;s, oligonucleotide probes based on these sequences may be 
designed and synthesized. The use of such probes to detect mRNA is well established 
in the research community, see e.g. Basic DNA and RNA Protocols/edited by 
A.J.Harwood, Methods in Molecular Biology vol. 58 (1996), from Humana Press Inc. 
Totowa, New Jersey. 

In addition to oligonucleotide probes, mRNA extracted from the tissues and cell types 
of interest is required, preferably in a form ready to use in Northern analysis. Several 
companies offer such material, e.g. Invitrogen and Clontech. Briefly, they provide 
RNA extracted from a great many human and non-human tissues or cell types 
immobilized on membranes, as an array or size-fractionated. 

In a next step, a detectable label needs to be attached to the oligonucleotide probe(s). 
The label is traditionally in the form of a radioactive isotope, but may to advantage be a 
chemiluminescent reagent or a fluorescent agent. See e.g. DNA Probes by Keller and 
Manak (1993), from Macmillan Publishers. Several companies offer reagents to label 
nucleotide probes, e.g. Ambion (Austin, Texas) and Molecular Probes (Eugene, 
Oregon). 

The actual probing procedure involves contacting the immobilized mRNA (s) with the 
probe(s), washing away unbound probe(s) and detecting the signal(s) from the probe(s) 
that bound under the conditions tested, a positive signal indicating that the target(s) of 
the probe(s) was present in the sample(s) subjected to the test. In its simplest form, the 
test is "one-to-one", i.e. each sample of mRNA is exposed to each probe. However, it 
may be advantageous to exploit the sequence hierarchy of the PDE;s, by first probing 
arrays of mRNA from multiple sources with family-specific probes, then examining 
first positives with isotype-specific probes, and then examining the secondary positives 
in detail with very specific probes. One could also multiplex the probing by adding 
different distuingishable fluorescent labels to the probes, thus obtaining information 
from several probes in one experiment. 

The outcome of the analysis is information regarding the expression pattern(s) of 
PDE;s. 

Based on their expression pattern(s) specific PDE;s are then selected for further study, 
and genetic probes are constructed. 
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In general, a genetic probe, i.e. a "GeneX^-GFP fusion or a CFP-^GeneX" fusion, is 
constructed using PCR with "GeneX^-specific primers followed by a cloning step to 
fuse "GeneX" in frame with GFP. The fusion may contain a short vector derived 
sequence between "GeneX" and GFP (e.g. part of a multiple cloning site region in the 
plasmid) resulting in a peptide linker between "GeneX" and GFP in the resulting fusion 
protein. 

The fusion may be made using ploymerase chain reaction techniques, which are 
common laboratory procedures, see e.g. PCR Protocols/edited by B.A.White, Methods 
in Molecular Biology vol. 15 (1993 ), from Humana Press Inc. Totowa, New Jersey. 

In more detail, the steps involved include: 

- Design of gene-specific primers. Inspection of the sequence of the gene allows design 
of gene-specific primers to be used in a PCR reaction. Typically, the top-strand primer 
encompasses the ATG start codon of the gene and the following ca. 20 nucleotides, 
while the bottom-strand primer encompasses the stop codon and the ca. 20 preceding 
nucleotides, if the gene is to be fused behind GFP, i.e. a GFP-"GeneX" fusion. If the 
gene is to be fused in front of GFP, i.e. a "GeneX"-GFP fusion, a stop codon must be 
avoided. Optionally, the full length sequence of GeneX may not be used in the fusion, 
but merely the part which localizes and redistributes like GeneX in response to a signal. 
In addition to gene-specific sequences, the primers contain at least one recognition 
sequence for a restriction enzyme, to allow subsequent cloning of the PCR product. The 
sites are chosen so that they are unique in the PCR product and compatible with sites in 
the cloning vector. Furthermore, it may be necessary to include an exact number of 
nucleotides between the restriction enzyme site and the gene-specific sequence in order 
to establish the correct reading frame of the fusion gene and/or a translation initiation 
concensus sequence. Lastly, the primers always contain a few nucleotides in front of 
the restriction enzyme site to allow efficient digestion with the enzyme. 

-Identifying a source of the gene to be amplified. In order for a PCR reaction to produce 
a product with gene-specific primers, the gene-sequence must initially be present in the 
reaction, e.g. in the form of cDNA. The results of the extensive expression analysis 
performed previously will provide clear information regarding what tissue(s) are useful 
as source material. cDNA libraries from a great variety of tissues or cell types from 
various species are commercially available, e.g. from Clontech (Palo Alto), Stratagene 
(La Jolla) and Invitrogen (San Diego). Many genes are also available in cloned form 
from The American Type Tissue Collection (Virginia). 

- Optimizing the PCR reaction. Several factors are known to influence the efficiency and 
specificity of a PCR reaction, including the annealing temperature of the primers, the 
concentration of ions, notably Mg 2+ and K + , present in the reaction, as well as pH of the 
reaction. If the result of a PCR reaction is deemed unsatisfactory, it might be because 
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the parameters mentioned above are not optimal. Various annealing temperatures should 
be tested, e.g. in a PCR machine with a built-in temperature gradient, available from 
e.g. Stratagene (La Jolla), and/or various buffer compositions should be tried, e.g. the 
OptiPrime buffer system from Stratagene (La Jolla). 

- Cloning the PCR product. The vector into which the amplified gene product will be 
cloned and fused with GFP will already have been taken into consideration when the 
primers were designed. When choosing a vector, one should at least consider in which 
cell types the probe subsequently will be expressed, so that the promoter controlling 
expression of the probe is compatible with the cells. Most expression vectors also 
contain one or more selective markers, e.g. conferring resistance to a drug, which is a 
useful feature when one wants to make stable transfectants. The selective marker should 
also be compatible with the cells to be used. 

The actual cloning of the PCR product should present no difficulty for the person 
skilled in the art as it typically will be a one-step cloning of a fragment digested with 
two different restriction enzymes into a vector digested with the same two enzymes. If 
the cloning proves to be problematic, it may be because the restriction enzymes did not 
work well with the PCR fragment. In this case one could add longer extensions to the 
end of the primers to overcome a possible difficulty of digestion close to a fragment 
end, or one could introduce an intermediate cloning step not based on restriction enzyme 
digestion. Several companies offer systems for this approach, e.g. Invitrogen (San 
Diego) and Clontech (Palo Alto). 

Once the gene has been cloned and, in the process, fused with the GFP gene, the 
resulting product, usually a plasmid, should be carefully checked to make sure it is as 
expected. The most exact test would be to obtain the nucleotide sequence of the fusion- 
gene. 

Once a DNA construct for a probe has been generated, its functionality and usefulness 
may be tested by subjecting it to the following tests: 

- Transfecting it into cells capable of expressing the probe. The fluorescence of the cell 
is inspected soon after, typically the next day. At this point, two features of cellular 
fluorescence are noted: 

- The intensity should usually be at least as strong as that of unfused GFP in the cells. If 
it is not, the sequence or quality of the probe -DNA might be faulty, and should be 
carefully checked. 

-The sub-cellular localization is an indication of whether the probe is likely to perform 
well. 

If it localizes as expected for the gene in question, e.g. is excluded from the nucleus, it 
can immediately go on to a functional test. If the probe is not localized soon after the 
transfection procedure, it may be because of overexpression at this point in time, as the 
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cell typically will have taken of very many copies of the plasmid, and localization will 
occur in time, e.g. within a few weeks, as plasmid copy number and expression level 
decreases. If localization does not occur after prolonged time, it may be because the 
fusion to GFP has destroyed a localization function, e.g. masked a protein sequence 
essential for interaction with its normal cellular anchor-protein. In this case the opposite 
fusion might work, e.g. if GeneX-GFP does not work, GFP-GeneX might, as two 
different parts of GeneX will be affected by the proximity to GFP. If this does not 
work, the proximity of GFP at either end might be a problem, and it could be attempted 
to increase the distance by incorporating a longer linker between GeneX and GFP in the 
DNA construct. 

If there is no prior knowledge of localization, and no localization is observed, it may be 
because the probe should not be localized at this point, because such is the nature of the 
protein fused to GFP. It should then be subjected to a functional test. 
In a functional test, the cells expressing the probe are treated with at least one compound 
known to perturb, usually by activating, the signalling pathway on which the probe is 
expected to report by redistributing itself within the cell. 

If the redistribution is as expected, e.g. if prior knowledge tell that it should translocate 
from location X to location Y, it has passed the first critical test. In this case it can go on 
to further characterization and quantification of the response. 

If it does not perform as expected, it may be because the cell lacks at least one 
component of the signalling pathway, e.g. a cell surface receptor, or there is species 
incompatibility, e.g. if the probe is modelled on sequence information of a human 
geneproduct, and the cell is of hamster origin. In both instances one should identify 
other cell types for the testing process where these potential problems would not apply. 
If there is no prior knowledge about the pattern of redistribution, the analysis of the 
redistribution will have to be done in greater depth to identify what the essential and 
indicative features are, and when this is clear, it can go on to further characterization and 
quantification of the response. 

If no feature of redistribution can be identified, the problem might be as mentioned 
above, and the probe should be retested under more optimal cellular conditions. 

Libraries for cloning of cDNA libraries in the present discovery plan are naturally 
related to the target tissues of the projects. For ultimately finding lead compounds useful 
in the treatment of hyper- and hypotension and erectile dysfunction the cloning libraries 
should preferably be obtained from one or more of the following tissue or cell types: 
vascular smooth muscle, vascular smooth muscle from resistance vessels on the arterial 
side of the vascular system, vascular smooth muscle from capacitance vessels on the 
venous side of the vascular system, vascular smooth muscle cells from small arteries, 
arterioles, venules or veins, smooth vascular cells lines such as T/G HA-VSMCA10 
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and A7r5. For ultimately finding lead compounds useful in the treatment of jet-lag the 
cloning libraries should preferably be obtained neurons form the brain. 
The cells should always be of animal origin, most likely of mammalian origin and 
preferably of human origin. The cells could be derived from normal tissue or from 
tissue of an individual animal having a disease or condition of interest for the project. 
The cells may also be a mortal or immortalised cell line where the initial cell clone has 
been derived from a tissue or cell type as described above. Depending on the discovery 
project the cells of interest for screening assays will vary but may be chosen from the 
above mentioned categories. 

Once a genetic construct containing the protein of interest and the luminophore, from 
here on referred to as "the original fluorescent probe", has been transfected into a 
relevant cell type, as described above under 'preferred cell types for cloning libraries' 
the cells are monitored for the appearance of spatially distributed or randomly 
distributed intracellular fluorescence. Based on prior knowledge regarding the 
distribution of the actual protein different patterns can be expected. If for example 
previous studies have found the protein associated only with the particulate fraction of 
fractionated cells, it can be expected to find a spatial distribution of the original 
fluorescent probe to the plasma membrane, internal membrane/organelle structures or 
structural cytoplasmic elements such as microtubules and microfilaments. If on the other 
hand previous studies report that the protein has been found mostly in the soluble 
fraction of fractionated cells one can expect to find a homogenous or nonhomogenous 
distribution of the original fluorescent probe throughout the cytoplasm and perhaps also 
in the nucleus. For proteins where previous studies have found a mixed localisation to 
both the particulate and soluble fraction of fractionated cells any mixture in the two 
distribution patterns mentioned above for the original fluorescent probe can be expected. 
For proteins where no prior knowledge is at hand a simple cell fractionation and 
Western Blotting can be made, one can use immunohistochemistry of fixed cells of 
relevance or one can decide to rely on the distribution observed for the original 
fluorescent probe. At this stage of the project, a normal distribution pattern of the 
original fluorescent probe may be established after such studies as outlined above. The 
effects of physiologically important and relevant cellular activation on the distributed 
pattern of the original fluorescent probe is also established. It will also become evident 
if the pattern of distribution changes, i.e. if a redistribution of the original fluorescent 
probe occurs as a consequence of applying a physiologically important and relevant 
influence. 

By the methods described in appendix A it may be possible to find chemical entities 
which can interfere with the protein-protein interactions that occur amongst PDE;s and 
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their anchonng/regulating partners, and thereby interfere with the effectiveness of a 
PDE isoform' s action within its cellular environment. This strategy will have different 
effects (and require slightly different discovery methods) depending on the nature of the 
interaction. The possibililties are as follows: 

1 ) A PDE isoform is permanently located at its targeting point, and either remains 
permanently active there, or its activity is modulated in some way by post-translational 
modification such as phosphorylation or by binding of modulators to non-catalytic 
regulatory sites. Dislocation from the targeting site will remove the PDE from a 
localised site of action, and may also lead to inactivation of its inherent catalytic activity. 

2) A PDE isoform is permanently located at its targeting point, and remains inactive 
there until its activity is modulated in some way by post-translational modification, such 
as phosphorylation or by binding of modulators to non-catalytic regulatory sites. 
Dislocation from the targeting site will remove the PDE from a localised site of action, 
and may also lead to activation of its inherent catalytic activity, albeit away from its 
original anchoring site. 

3) A PDE isoform is inactive in its unattached or untargeted form, and when 
activated (as described in "1" above), or partially activated, it redistributes within the 
cell and becomes attached to its targeting site, its activity being restricted to the 
anchoring site and possibly enhanced by interaction with the anchoring protein or some 
associated factor, or at some later time inhibited by the anchoring protein or an 
associated regulatory factor. Any agent which prevents association of the PDE with its 
anchoring or targeting site will prevent it from locating to the preferred site of action, 
and may also prevent the PDE from becoming fully activated by the appropriate 
stimulus whilst in the untargeted state. 

4) A PDE isoform is active in its unattached or untargeted form, and when 
inactivated (as described in "1" above), or partially inactivated, it redistributes within the 
cell and becomes attached to its targeting site, whereby its activity is inhibited by 
interaction with the anchoring protein or an associated regulatory factor. Subsequent 
stimuli may then activate and release the PDE. Any agent which prevents association of 
the PDE with its anchoring or targeting site will prevent it from relocating to the 
anchoring position, and may also prevent the PDE from ever being inactivated. In 
addition, if the PDE cannot target to its anchoring site, it may not be possible 
subsequently to activate the PDE in the appropriate way in the untargeted state. 

When a specific subcellular distribution of a GFP-based PDE probe has been identified, 
it may be advantageous to narrow down which part of the PDE is responsible for this 
effect. The advantage is twofold: It may suggest the design of peptide leads, and it may 
eventually aid in defining the binding partner. Knowledge of both partners involved in 
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specific binding may aid in the selection of compound libraries to screen for inhibition 
of the specific binding. 

To identify the region of the PDE involved in specific binding, one may make GFP- 
based fusions with progressively shorter parts of the PDE, and examine the cellular 
distribution of these constructs. If there is prior knowledge of functional domains, one 
may start with the domain believed to confer specific binding to a subcellular structure. 
The generation of constructs to test may consist of selecting a particular part of the PDE 
to fuse to GFP, or it may involve the generation of in-frame deletions in the PDE part of 
the fusion. Both approaches have been widely used in molecular genetic studies. 
When a region has been identified that appears responsible for conferring a specific 
subcellular distribution upon a PDE, the amino acid residues most important for this 
trait may be identified by a more detailed analysis, e.g. substituting them one by one 
with e.g. an alanine residue, a so called Ala-scan, which also has been used extensively 
in molecular genetic studies. 

To identify the identity of the cellular protein partaking in the specific distribution of the 
PDE, one may exploit the knowledge about the region of the PDE responsible for the 
subcellular distribution, e.g. one may use the region of the PDE as bait in a genetic two 
hybrid screen to pull out its binding partner. Several companies offer two hybrid 
systems, e.g. Life Technologies. 

The knowledge about the normal distribution of the original fluorescent probe is used to 
establish which part or which parts of the terminal (or entire) amino-acid sequence that 
is important for the attachment of this fluorescent probe to subcellular structures, giving 
it its specific spatially distributed pattern in the cell or cells, when such a pattern has 
been established as the normal distribution of this fluorescent probe. This may be 
accomplished by creating new fluorescent probes where a systematic deletion of short 
N- or C-terminal or internal sequences (number of DNA bases) of the original 
fluorescent probe are made. These new shorter variants of the of the original 
fluorescent probe construct are transfected into the cells of interest and then the cells are 
examined for spatial distribution of the new fluorescent probes as described above for 
the original fluorescent probe. In those cells where the new fluorescent probe 
distribution pattern is different from the original fluorescent probe distribution pattern it 
is evident that part of the, or the entire, targeting sequence has been deleted. The DNA- 
or amino-acid sequence of the missing part therefore contains the structural information 
necessary for association of the original fluorescent probe with its intracellular binding 
sites. 

Peptides for inhibition of the established normal distribution of the original fluorescent 
probe are made according to the following hypothesis, that the deduced targeting 
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sequence, or sequences, in the original fluorescent probe amino-acid sequence are the 
important sequences for the actual spatial distribution of the original fluorescent probe 
in intact living cells, is tested. This is done by producing peptides of identical amino- 
acid sequence as the deduced targeting sequence or parts thereof and introducing them 
into the cytoplasm, either by microinjection or transient or permanent permeabilisation, 
of cells containing the original fluorescent probe and thereafter monitoring the spatial 
distribution of the original fluorescent probe in the cells. If the deduced targeting 
sequence or sequences are of importance for the actual spatial distribution of the original 
fluorescent probe in intact living cells, the introduced peptides will self-associate with 
the anchoring sites for the original fluorescent probe and thereby disrupt the normal 
distribution of the original fluorescent probe. In order to have this effect, the 
introduction of the peptides should change the original distribution pattern so that a 
decrease in fluorescence of 10% or more, compared to the pattern before their 
introduction, can be detected. This is done by observing the same cells before and after 
administration of the peptides. When peptides that fulfil this criterion have been found 
they are called 'peptide leads' and will hereafter be referred to using this expression. 
These peptide leads can now be used as a basis for the design of organic molecules that 
can be used eventually to disrupt the spatial distribution of the original fluorescent probe 
but also as control compounds in screening assays. 

In parallel to the above mentioned procedure wherein peptide leads are defined, the 
distribution pattern found for the original fluorescent probe is compared to the naturally 
occurring spatial distribution of the protein on which the original fluorescent probe is 
based. This may be accomplished by fixation of primary cells separated or within the 
tissue of interest and fixation of cells that contain the original fluorescent probe. 
Thereafter the protein is stained using ordinary immunocytochemical or 
immunohistochemical methods and the spatial distribution revealed by this staining 
procedure is compared to the spatial distribution of the original fluorescent probe. It is 
desirable, but not required, that a high degree of correlation between the two patterns 
obtained in this step can be observed. 

The establishment of a primary screening assay is normally done by making use of the 
cells of interest containing the original fluorescent probe as the basis for a screening 
assay. Depending on the knowledge acquired about the behaviour of the original 
fluorescent probe when subjecting the cells to physiologically relevant influences the 
assay procedure can be chosen: 1. If the fluorescent probe normally is targeted to 
specific sites and stay associated with these sites during stimulation of the intracellular 
pathway the assay should preferably be designed to detect dislocation of the original 
fluorescent probe from the targeting sites in mechanically intact or permeabilised living 
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cells. This is an assay where the dislocation can be detected within minutes after 
application of an influence and the time frame for the detection and time for exposing the 
cells to an influence should be chosen to match this. 2. If the desire is to disrupt the 
actual targeting event rather than dislocate already targeted fluorescent probe the 
influence may need hours to produce a detectable response. The actual measurement, 
still of a change in the fluorescence or luminescence distribution pattern compared to the 
normal distribution pattern for the original fluorescent probe, may be made at two time 
points; before and after the influence has exerted any effect it may have. This is an assay 
where the effect of an influence may require several hours to produce a detectable 
response and the time frame for the detection and time for exposing the cells to an 
influence should be chosen to match this. 3. If the fluorescent probe normally 
redistributes between two intracellular sites upon activation of the intracellular pathway 
one may either want to disrupt the initial targeting or dislocate the original fluorescent 
probe from its initial or resting anchoring site. In this case procedure no. 1 above may 
be used. If the desire instead is to inhibit the association of the original fluorescent 
probe with the site it redistributes to during activation of the intracellular pathway the 
targeting sequence of this site should be in focus for the lead peptide generation. This is 
an assay where the redistribution may be detected within minutes after application of an 
influence and the time frame for the detection and time for exposing the cells to an 
influence should be chosen to match this. Furthermore, any influence applied to inhibit 
the targeting of the original fluorescent probe upon its redistribution may need to be 
added to the cells before activation of the intracellular pathway. 

While the original fluorescent probe and peptide leads will be used in the actual primary 
screening assay, it is also desirable to have a counterscreen or counterscreens directed at 
protein isoforms that one does not wish to affect. In order to accomplish this, constructs 
are made for new fluorescent probes encoding the protein isoforms tagged with GFP. 
These constructs are subsequently transfected into the cells of interest. When the new 
fluorescent probes are expressed in the cells, some of the cells are chosen as the basis 
for new cell lines that can be used in the counterscreen or counterscreens. 
Suitable probes for this purpose comprise DNA constructs encoding fusion 
polypeptides comprising forms of PDE 1, PDE 2, PDE 5, PDE 6 or PDE 9 and GFP. 
In a preferred embodiment the DNA constructs will encode fusion polypeptides 
comprising isoforms of mPDE 5 and GFP. 

It may be desirable to establish peptide leads that more specifically dislocate the desired 
isoform of the protein of interest compared to other isoforms of the same protein. In 
such a case, both the cell lines established for the primary screen and the counterscreen 



22132DK1 



28 



or counterscreens are used. The peptide leads are introduced into the cells as described 
above and the changes in spatial distribution of the original and counterscreen 
fluorescent probes are quantified and dose-response relationships are established for 
each lead peptide. Thereafter the dose-response relationships are compared. A peptide 
lead is considered specific for the original fluorescent probe if the dose of the peptide 
required to dislocate at least 10% of the fluorescent probes in the counterscreen or 
conterscreens are at least two times higher than the dose required to dislocate 10% of the 
original fluorescent probe. The lead peptides with the biggest dose difference when 
comparing the primary and the counterscreen dose-response relationships are chosen as 
the basis for the next step in the discovery project. 

In one embodiment the primary screening assay and counterscreen or counterscreens are 
used to define specificity of the peptide leads by using a procedure that compares their 
ability to cause a dislocation, disruption of targeting or inhibition of redistribution of the 
original fluorescent probe in the primary screening assay to their ability to cause a 
dislocation, disruption of targeting or inhibition of redistribution of the new fluorescent 
probes in the counterscreen or counterscreens. 

In a preferred embodiment the dose of a peptide lead required to cause a quantified 
dislocation, disruption of targeting or inhibition of redistribution of the original 
fluorescent probe of at least 10% in the primary screening assay is 50% or less of the 
dose required to cause a quantified dislocation, disruption of targeting or inhibition of 
redistribution of the new fluorescent probes of at least 10% in the counterscreen or 
counterscreens. 

The invention provides for a specificity index which may be constructed describing a 
numerical relationship, with the primary screening asay result first, of the dose required 
to produce half-maximal effect in the primary assay compared to the dose required to 
produce half-maximal effect in the counterscreen or counterscreens. 
In one embodiment the peptide leads chosen for further use in the discovery project 
have a specificity index of 1 to 2. 

In another embodiment the peptide leads chosen for further use in the discovery project 
have a specificity index between 1 to 2 and 1 to 10. 

In a further embodiment the peptide leads chosen for further use in the discovery project 
have a specificity index between 1 to 1 1 and 1 to 100. 

In yet a further preferred embodiment the peptide leads chosen for further use in the 
discovery project have a specificity index better than 1 to 100. 

The lead peptides may be used to create and select libraries of small organic molecules 
that can be useful in screening assays to find bioactive substances useful as drugs to 
treat the condition or disease of interest for the project. In order to accomplish this, the 
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amino-acid sequence information and other structural information about the lead peptide 
or peptides is used to extract information useful for finding and/or defining and 
synthesising bioactive organic molecules that can mimic the effect of the lead peptides 
on the normal spatial distribution pattern of the original fluorescent probe. Peptide leads 
selected by the discovery project are used to design and assemble compound libraries 
based on the structural and chemical information inherent in the lead peptides using 
prior chemical knowledge and computational chemistry approaches so that the 
compounds have a structure that give them the ability to interact with or bind to the 
targeting sequence of mPDE 5, thereafter testing the compound libraries at a 
concentration of 10 or 100 micromolar of each compound in the primary screening 
assay. 

When the libraries of compounds have been defined and are at hand it is time to initiate 
primary screening. In this procedure, cells containing the original fluorescent probe are 
contacted with the compounds. The compounds are all tested at just one or a few 
concentrations, typically 10 and 100 micromolar, in a highly parallel fashion using a 
quantitative fluorescence redistribution assay. Compounds that cause a change in the 
quantitated response (the response scale defined by the range 0 (no change in 
redistribution) - 100%) of the assay by more than a predetermined value, typically 
between 10 and 100%, are considered to be "primary hits". The primary hits are then 
further characterised: 1. for potency by establishing a dose-response relationship 
compared to the lead peptide(s) using the primary screening assay 2. for selectivity by 
establishing a dose-response relationship in the counterscreen or counterscreens. 
Primary hits that have low potency, typically when the half-maximal effect of the 
compound in the primary assay is achieved at a concentration of the compound between 
10 and 100 micromolar, may not need testing in the counterscreen or counterscreens 
since the likelihood that they will be used beyond this step in the discovery project is 
small. Primary hits that have equal or lower potency in the primary screening assay 
compared to the counterscreen or counterscreens are regarded as non-selective and the 
likelihood that they will be used beyond this step in the discovery project is small. 
Primary hits that display some degree of selectivity, typically half maximal effect in the 
primary screening assay at a concentration 50% or less of the concentration that gives 
half maximal effect in the counterscreen or counterscreens are considered interesting as 
the basis for further chemical synthesis or construction of new libraries of compounds 
and will hereafter be referred to as "primary lead compounds". 

Compounds that cause a change in the quantitated response, with a response scale from 
0 to 100% based on the absence of a response and the maximal response observed with 
the peptide leads in the primary screening assay, of the assay by more than a 
predetermined value are selected and called "primary hits". 
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In one embodiment the predetermined value is 10%. 
In another embodiment the predetermined value is 50%. 
In yet another embodiment the predetermined value is 70%. 

In one embodiment the primary hits are further characterised for potency (as defined 
herein) and maximal effect by establishing a dose-response relationship (as defined 
herein) and comparing that to the effects of the lead peptides using the primary 
screening assay and for selectivity (as defined herein) by establishing a dose-response 
relationship in the counterscreen or counterscreens. 

Primary hits may be deselected by the discovery project when they display a half- 
maximal potency at a dose corresponding to a concentration of more than 10 micromolar 
or because they display a selectivity index less than 1 to 2. 

Primary hits may be selected by the discovery project when they display a half-maximal 
potency at a dose corresponding to a concentration of 10 micromolar or less or because 
they display a selectivity index higher than 1 to 2, the compounds hereafter also referred 
to as "primary lead compounds". 

To further improve the possibilities of finding bioactive compounds with desirable 
properties for treatment of the diseases or conditions of interest to the project one may at 
this stage build a Structure-Activity Relationship (SAR) by iterations of compound 
library composition and screening to define drug candidate leads. The primary lead 
compounds are here used to provide chemical structural information that can be used as 
the basis for composition or chemical synthesis of new, directed, compound libraries. 
By systematic chemical modification of part of the structure of one or more primary lead 
compounds new libraries are assembled. These new libraries of compounds are also 
investigated using the primary screening assay and counterscreen or counterscreens. 
Preferably, dose-response relationships are recorded for each chemical modification of 
the primary lead compound and compared to the primary lead compound itself. Thereby 
SAR is established. Among the new compounds, the ones that in this step has the best 
combination of potency and specificity are chosen either as the basis for a new round of 
compound library synthesis or composition or, as the final step of the SAR building 
process, as compounds that will be further for actual pharmacoloical effects in assay 
systems and animals that are relevant to the underlying physiological and 
patophysiological processes of interest to the project. The latter compounds will 
hereafter be referred to as "drug candidate leads". 

In one embodiment drug candidate leads have a half-maximal potency at a dose 
corresponding to a concentration of less than 1 micromolar and a selectivity index 
higher than 1 to 2. 
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In one embodiment the drug candidate leads have a half-maximal potency at a dose 
corresponding to a concentration of less than 1 micromolar and a selectivity index 
higher than 1 to 10. 

In one embodiment the drug candidate leads have a half-maximal potency at a dose 
corresponding to a concentration of less than 1 micromolar and a selectivity index 
higher than 1 to 100. 

In one embodiment the drug candidate leads have a half-maximal potency at a dose 
corresponding to a concentration of less than 0,1 micromolar and a selectivity index 
higher than 1 to 2. 

In a preferred embodiment the drug candidate leads have a half-maximal potency at a 
dose corresponding to a concentration of less than 0,1 micromolar and a selectivity 
index higher than 1 to 10. 

In another preferred embodiment the drug candidate leads have a half-maximal potency 
at a dose corresponding to a concentration of less than 0,1 micromolar and a selectivity 
index higher than 1 to 100. 

To validate the drug candidate leads in vitro is often preferred at this stage. In this case, 
drug candidate leads may be further characterised in tissue based, cell based and 
biochemical assays for efficacy and toxicity. There are many ways to test efficacy of a 
drug candidate lead. Preferably, the drug candidate lead is tested in assay systems with 
high relevance to the underlying physiological and patophysiological processes involved 
in the patogenesis and patophysiology of the disease or condition of interest to the 
project. Likewise, the drug candidate leads are tested for toxic effects, preferably testing 
for genetic effects (influence on the integrity and arrangement of DNA), metabolic 
effects (influence on cellular metabolic processes) and cytotoxic effects (influence on 
cell integrity and organelle integrity). There is a high likelihood that drug candidate 
leads, that do not show appropriate efficacy or that display toxicity will not be used 
beyond this step in the discovery project because it is expected that such compounds are 
less suitable as actual drugs to be used in an animal. 

In one embodiment drug candidate leads chosen by the discovery project are tested in 
vitro for efficacy (as defined herein), in assay systems with high degree of relevance to 
the underlying physiological and patophysiological processes involved in hypotension, 
and for toxicity, preferably testing for genetic, metabolic and cytotoxic effects, 
whereafter the drug candidate leads that display the best efficacy and the least, or no, 
indications of toxicity are chosen to be the candidates that will enter testing in animals. 
In another embodiment drug candidate leads chosen by the discovery project are tested 
in vitro for efficacy, in assay systems with high degree of relevance to the underlying 
physiological and patophysiological processes involved in hypertension, and for 
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toxicity, preferably testing for genetic, metabolic and cytotoxic effects, whereafter the 
drug candidate leads that display the best efficacy and the least, or no, indications of 
toxicity are chosen to be the candidates that will enter testing in animals. 
In another embodiment drug candidate leads chosen by the discovery project are tested 
in vitro for efficacy, in assay systems with high degree of relevance to the underlying 
physiological and patophysiological processes involved in jet-lag, and for toxicity, 
preferably testing for genetic, metabolic and cytotoxic effects, whereafter the drug 
candidate leads that display the best efficacy and the least, or no, indications of toxicity 
are chosen to be the candidates that will enter testing in animals. 

In another embodiment drug candidate leads chosen by the discovery project are tested 
in vitro for efficacy, in assay systems with high degree of relevance to the underlying 
physiological and patophysiological processes involved in erectile dysfunction, and for 
toxicity, preferably testing for genetic, metabolic and cytotoxic effects, whereafter the 
drug candidate leads that display the best efficacy and the least, or no, indications of 
toxicity are chosen to be the candidates that will enter testing in animals. 

Finally, it is mostly likely preferred to do an in vivo validation of the drug candidate 
leads. In this case the drug candidate leads are tested for toxic and unwanted effects in 
animals such as mice and rats. The drug candidate leads are also tested for efficacy in 
animals that have a disease or condition with high degree of relevance to the disease or 
condition of interest to the project. The drug candidate leads may also be tested for 
efficacy in animals which have been treated in a way that make them experience a 
disease or condition with high degree of relevance to the disease or condition of interest 
to the project. Drug candidate leads that display efficacy in one or more of such animal 
tests and that does not display any apparent toxicity at a dosage level, preferably 2 -10 
times higher than the level that gives satisfactory efficacy are chosen to be the final drug 
candidates that should be considered for further animal testing and initial testing in 
humans. These compounds are hereafter referred to as "discovery project leads". 

In one embodiment drug candidate leads chosen by the discovery project are tested for 
efficacy, in healthy animals and animals with a condition with high degree of relevance 
to the underlying physiological and pathophysiological processes involved in 
hypotension, and for toxicity and unwanted side effects, after which the drug candidate 
leads that display the best efficacy and the least, or no, indications of toxicity or 
unwanted side effects are chosen to be the candidates, called discovery project leads, 
that will enter further testing in animals and testing in humans. 

In one embodiment drug candidate leads chosen by the discovery project are tested for 
efficacy, in healthy animals and animals with a condition with high degree of relevance 
to the underlying physiological and pathophysiological processes involved in 
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hypertension, and for toxicity and unwanted side effects, after which the drug 
candidate leads that display the best efficacy and the least, or no, indications of toxicity 
or unwanted side effects are chosen to be the candidates, called discovery project leads, 
that will enter further testing in animals and testing in humans. 

In one embodiment drug candidate leads chosen by the discovery project are tested for 
efficacy, in healthy animals and animals with a condition with high degree of relevance 
to the underlying physiological and pathophysiological processes involved in jet-lag, 
and for toxicity and unwanted side effects, after which the drug candidate leads that 
display the best efficacy and the least, or no, indications of toxicity or unwanted side 
effects are chosen to be the candidates, called discovery project leads, that will enter 
further testing in animals and testing in humans. 

In one embodiment drug candidate leads chosen by the discovery project are tested for 
efficacy, in healthy animals and animals with a condition with high degree of relevance 
to the underlying physiological and pathophysiological processes involved in erectile 
dysfunction, and for toxicity and unwanted side effects, after which the drug candidate 
leads that display the best efficacy and the least, or no, indications of toxicity or 
unwanted side effects are chosen to be the candidates, called discovery project leads, 
that will enter further testing in animals and testing in humans. 

The administration route of any of the compounds of the invention may be of any 
suitable route which leads to a concentration in the blood corresponding to a therapeutic 
concentration by the oral route, the parenteral route, the cutaneous route, the nasal 
route, the rectal route, the vaginal route and the ocular route. It should be clear to a 
person skilled in the art that the administration route is dependant on the compound in 
question, particularly, the choice of administration route depends on the physico- 
chemical properties of the compound together with the age and weight of the patient and 
on the particular disease and the severity of the same. 

The compounds of the invention may be contained in any appropriate amount in a 
pharmaceutical composition, and are generally contained in an amount of about 1-95% 
by weight of the total weight of the composition. The composition may be in form of, 
e.g., tablets, capsules, pills, powders, granulates, suspensions, emulsions, solutions, 
gels including hydrogels, pastes, ointments, creams, plasters, drenches, delivery 
devices, suppositories, enemas, injectables, implants, sprays, aerosols and in other 
suitable form. The pharmaceutical compositions may be formulated according to 
conventional pharmaceutical practice, see, e.g., "Remington's Pharmaceutical Sciences" 
and "Encyclopedia of Pharmaceutical Technology". 

Pharmaceutical compositions according to the present invention may be formulated to 
release the active compound substantially immediately upon administration or at any 
substantially predetermined time or time period after administration. The latter type of 
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compositions are generally known as controlled release formulations. Controlled 
release formulations may also be denoted "sustained release", "prolonged release", 
"programmed release", "time release", "rate-controlled" and/or "targeted release" 
formulations. 

In the present context every pharmaceutical composition is an actual drug delivery 
system, since upon administration it presents the active drug substance to the body of 
the organism. 

The compounds of the invention are preferably administered in an amount of about 0.1- 
30 mg per kg body weight per day, such as about 0.5-15 mg per kg body weight per 
day. The compound in question may be administered orally in the form of tablets, cap- 
sules, elixirs or syrups, or rectally in the form of suppositories. Parenteral 
administration of the compounds of the invention, is suitably performed in the form of 
saline solutions of the compounds or with the compound incorporated into liposomes. 
In cases where the compound in itself is not sufficiently soluble to be dissolved, an acid 
addition salt of a basic compound can be used, or a solubilizer such as ethanol can be 
applied. 

Oral administration. For compositions adapted for oral administration for systemic use, 
the dosage is normally 1 mg to 1 g per dose administered 1-4 times daily for 1 week, 12 
months or even lifelong depending on the disease to be treated. 

Rectal administration. For compositions adapted for rectal a somewhat higher amount of 
compound is usually preferred, i.e. from approximately 1 mg to 100 mg per kg body 
weight per day. 

Parenteral administration. For parenteral administration a dose of about 0.1 mg to about 
50 mg per kg body weight per day is convenient. For intravenous administration a dose 
of about 0.1 mg to about 20 mg per kg body weight per day. For intraarticular 
administration a dose of about 0.1 mg to about 20 mg per kg body weight per day is 
usually preferable. For parenteral administration in general, a solution in an aqueous 
medium of 0.5-2% or more of the active ingredients may be employed. 
Percutaneous administration . For topical administration on the skin a dose of about 1 
mg to about 5 g administered 1-10 times daily is usually preferable. 

EXAMPLES 

Probes for detection of PDE5 dislocation: 
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These are specific PDE5 variants fused to a GFP. Currently only one main human 
variant is known (GenBank Acc.nos. AJ004865 and D89094). 

Inspection of the scientific litterature indicates that the catalytic domain is contained in the C-terminal 
part of the protein, so potential targeting sequences of PDE5 may be located in the N-terminal part. 
To best preserve the normal distribution of PDE5, the first fusion is made between the C-terminus of 
the PDE5 species and the N-terminal of the GFP. 

To construct the PDE5-GFP fusions, PDE5 sequences are amplified using 
PCR according to standard protocols with the specific primers listed below. The PCR 
product is digested with restriction enzymes EcoRl and Acc65I, and ligated into 
pEGFP-Nl (Clontech, Palo Alto; GenBank Accession number U55762) digested with 
EcoRl and Acc65I. This produces a PDE5-EGFP fusion under the control of a CMV 
promoter (SEQ.ID.NO 1&2). 

The top primer includes specific sequences following the ATG, a Kozak sequence, and 
a cloning site (EcoRl). The bottom primer includes specific C-terminal sequences 
minus the stop codon, an Acc65I cloning site, and two extra nucleotides to preserve the 
reading frame in EGFP-N1. 

PDE5-top (SEQ.ID.NO. 3): 

5 ' -GTG A ATTC A ACC ATGG AGCGGGCC-3 ' 

PDE5-bottom (SEQ.ID.NO. 4): 

5 ' -GTGGTACCC AGTTCCGCTTGGCC 

The resulting plasmids are transfected into a suitable cell line, e.g. MVLEC. The 
subcellular distribution of the probes is examined carefully by fluorescence microscopy, 
both under resting conditions, and upon elevation of cGMP, e.g. by activation of 
cyclase with NO or nitroprusside, which may or may not have an effect on the normal 
distribution. 
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CLAIMS 

1 . A method for preventing or treating, in an animal in need thereof, an adverse 
condition which may be reduced or abolished by modulating the activity of one or more 
cyclic nucleotide phosphodiesterases having the ability to cleave cyclic GMP, the 
method comprising modulating the specific effectiveness of the cyclic nucleotide 
phosphodiesterase by modulating their spatial distribution within cells of the animal. 

2. A method according to claim 1, wherein the cyclic nucleotide phosphodiesterase is 
selected from the group consisting of PDE 1 , PDE 2, PDE 5, PDE 6, and PDE 9. 

3. A method according to claim 1 or 2, wherein the cyclic nucleotide phosphodiesterase 
is a PDE 5 species. 

4. A method according to claim 3, wherein the PDE 5 species is a mPDE 5 species. 

5. A method according to any of claims 1-4, wherein the animal is a mammal. 

6. A method according to claim 5, wherein the mammal is a human being. 

7. A method according to any of claims 1-6, wherein the modulation of the specific 
effectiveness of the cyclic nucleotide phosphodiesterase is a dislocation from a native 
location within the cell. 

8. A method according to any of claims 1-6, wherein the modulation of the specific 
effectiveness of the cyclic nucleotide phosphodiesterase involves a disruption of the 
targeting of the cyclic nucleotide phosphodiesterase to a native location within the cell. 

9. A method according to any of claims 1-6, wherein the modulation of the specific 
effectiveness of the cyclic nucleotide phosphodiesterase involves interference with 
the redistribution of the cyclic nucleotide phosphodiesterase, the redistribution being 
associated with an increase or a decrease in the specific effectiveness of the cyclic 
nucleotide phosphodiesterase. 

10. A method according to any of claims 1-9, wherein the adverse condition is 
hypotension. 
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14. A method according to any of claims 1-9, wherein the adverse condition is 
hypertension. 

15. A method according to any of claims 1-9, wherein the adverse condition is cardiac 
failure or trombosis. 

16. A method according to any of claims 1-9, wherein the adverse condition is erectile 
dysfunction. 

18. A method according to any of claims 1-9, wherein the adverse condition is 
jet lag. 

19. A method according to any of claims 1-9, wherein the adverse condition is oedema 
caused by inflammation. 

20. A method according to any of the preceding claims, wherein the modulation of the 
specific effectiveness of the cyclic nucleotide phosphodiesterase is performed by 
exposing cells, in the animal in which dislocation, disruption of targeting, or 
interference with redistribution of a cyclic nucleotide phosphodiesterase may take place, 
to the influence of a substance which modulates the spatial distribution of the cyclic 
nucleotide phosphodiesterase in the cells. 

21. A method according to claim 20, wherein the substance is one which, in a 
quantitative fluorescence redistribution assay designed to monitor dislocation of cyclic 
nucleotide phosphodiesterase, causes dislocation of at least 10% of otherwise natively 
located cyclic nucleotide phosphodiesterase within the cell at a concentration of the 
substance of 100 micromolar. 

22. A method according to claim 21, wherein at least 50% of otherwise natively located 
cyclic nucleotide phosphodiesterase is dislocated within the cell at a concentration of the 
substance of 100 micromolar.. 

23. A method according to claim 21, wherein at least 70% of otherwise natively located 
cyclic nucleotide phosphodiesterase is dislocated within the cell at a concentration of the 
substance of 100 micromolar. 

24. A method according to claim 21, wherein at least 90% of otherwise natively located 
cyclic nucleotide phosphodiesterase is dislocated within the cell at a concentration of the 
substance of 100 micromolar. 
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25. A method according to claim 20, wherein the substance is one which, in a 
quantitative fluorescence redistribution assay, designed to monitor targeting of cyclic 
nucleotide phosphodiesterase, reduces targeting of the cyclic nucleotide 
phosphodiesterase to its native location within the cell by at least 10% at a concentration 
of the substance of 100 micromolar. 

26. A method according to claim 25, wherein the substance reduces targeting of the 
cyclic nucleotide phosphodiesterase to its native location within the cell by at least 50% 
at a concentration of the substance of 100 micromolar. 

27. A method according to claim 25, wherein the substance reduces targeting of the 
cyclic nucleotide phosphodiesterase to its native location within the cell by at least 70% 
at a concentration of the substance of 100 micromolar. 

28. A method according to claim 25, wherein the substance reduces targeting of the 
cyclic nucleotide phosphodiesterase to its native location within the cell by at least 90% 
at a concentration of the substance of 100 micromolar. 

29. A method according to claim 20, wherein the substance is one which, in a 
quantitative fluorescence redistribution assay, designed to monitor changes in 
redistribution caused by an influence, causes a reduction in the induced redistribution by 
at least 10% of the normal maximum redistribution at a concentration of the substance of 
100 micromolar. 

30. A method according to claim 29, wherein the substance causes a reduction in the 
induced redistribution of the cyclic nucleotide phosphodiesterase by at least 50% of the 
normal maximum redistribution at a concentration of the substance of 100 micromolar. 

31. A method according to claim 29, wherein the substance causes a reduction in the 
induced redistribution of the cyclic nucleotide phosphodiesterase by at least 70% of the 
normal maximum redistribution at a concentration of the substance of 100 micromolar. 

32. A method according to claim 29, wherein the substance causes a reduction in the 
induced redistribution of the cyclic nucleotide phosphodiesterase by at least 90% of the 
normal maximum redistribution at a concentration of the substance of 100 micromolar. 

33. A method according to any of claims 20-32, wherein the substance is an organic 
compound having a molecular weight of at the most 1200 Da. 
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34. A method according to any of claims 20-32, wherein the substance is an organic 
compound having a molecular weight of at the most 900 Da. 

35. A method according to any of claims 20-32, wherein the substance is an organic 
compound having a molecular weight of at the most 600 Da. 

36. A method according to any of claims 20-32, wherein the substance is an organic 
compound having a molecular weight of at the most 300 Da. 

37. A method according to any of claims 20-36, wherein the substance is a peptide. 

38. A method according to any of claim 20-36, wherein the substance is a carbon- 
containing non-peptide. 

39. A method according to any of claims 20-38, wherein the organic compound is a 
compound having one or more chemical domains capable of interacting with one or 
more functional groups of the targeting sequence of the native anchoring site of the 
cyclic nucleotide phosphodiesterase. 

40. A method according to claim 39, wherein the organic compound is a compound 
having at least two chemical domains capable of interacting with at least two functional 
groups of the targeting sequence of the native anchoring site for the cyclic nucleotide 
phosphodiesterase. 

41 . A method according to claim 39, wherein the organic compound is a compound 
having at least three chemical domains capable of interacting with at least three 
functional groups of the targeting sequence of the native anchoring site for the cyclic 
nucleotide phosphodiesterase. 

42. A method according to any of claims 20-38, wherein the organic compound is a 
compound having one or more chemical domains capable of interacting with one or 
more functional groups of the targeting sequence of the cyclic nucleotide 
phosphodiesterase. 

43. A method according to claim 42, wherein the organic compound is a compound 
having at least two chemical domains capable of interacting with at least two functional 
groups of the targeting sequence of the cyclic nucleotide phosphodiesterase. 
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44 A method according to claim 42, wherein the organic compound is a compound 
having at least three chemical domains capable of interacting with at least three 
functional groups of the targeting sequence of the cyclic nucleotide phosphodiesterase. 

45. A method according to any of claims 20-44, wherein the organic compound is a 
weak acid in that it is a neutral molecule that can reversible dissociate into an anion (a 
negatively charged molecule) and a proton (a hydrogen ion). 

46. A method according to claims 20-44, wherein the organic compound is a weak base 
in that it is a neutral molecule that can form a cation (a positively charged molecule) by 
combining with a proton (a hydrogen ion ). 

47. A method according to any of claims 39-46, wherein the functional groups of the 
targeting sequences include functional groups selected from the group consisting of: 
methyl-, isopropyl-, isobutyl-, hydroxyl-, thiol-, benzyl-, benzyloyl-, methylindolyl-, 
methylimidazolyl-, amine-, imine-, carboxyl- and acetamide-groups as parts of amino 
acids in the targeting sequences. 

48. A method according to any of claims 20-47, wherein the exposure of the animal to 
the influence of a substance is performed by administering an effective amount of the 
substance to the animal. 

49. A method according to claim 48, wherein the exposure of the animal to the influence 
of the substance is performed by administering an effective amount of the substance via 
the intravenous route of administration to the animal. 

50. A method according to claim 48, wherein the exposure of the animal to the influence 
of the substance is performed by administering an effective amount of the substance via 
the oral route of administration to the animal. 

51. A method according to claim 48, wherein the exposure of the animal to the influence 
of the substance is performed by administering an effective amount of the substance via 
the pulmonary route of administration to the animal. 

52. A method according to claim 48, wherein the exposure of the animal to the influence 
of the substance is performed by administering an effective amount of the substance via 
the rectal route of administration to the animal. 
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53. A method according to claim 48, wherein the exposure of the animal to the influence 
of the substance is performed by administering an effective amount of the substance via 
the transdermal route of administration to the animal. 

54. A method according to any of claims 21 - 53, wherein the quantitative fluorescence 
redistribution assay consists of cells selected from the group consisting of 

vascular smooth muscle cells such as vascular smooth muscle from resistance vessels 
on the arterial side of the vascular system, vascular smooth muscle from capacitance 
vessels on the venous side of the vascular system, vascular smooth muscle cells from 
small arteries, arterioles, venules, veins, any neuron from the brain, cariomyocytes, 
vascular endothelial cells and immortal cells lines of these cells, and smooth vascular 
cells lines such as T/G HA-VSMCA10 and A7r5, transfected with a nucleotide 
construct encoding a fluorescent probe comprising as the biologically active polypeptide 
either splice variant mPDE 5, or PDE splice variant cloned from vascular smooth 
muscle cells such as vascular smooth muscle from resistance vessels on the arterial side 
of the vascular system, vascular smooth muscle from capacitance vessels on the venous 
side of the vascular system, vascular smooth muscle cells from small arteries, arterioles, 
venules, veins, any neuron from the brain, or cariomyocytes, transfected in such a way, 
that the construct is expressed by the cells. 

55. A method according to claim 54, wherein the quantitative fluorescence redistribution 
assay is a primary screening assay used in a discovery project 

56. A method according to any of claim 54 or 55, wherein the cells are derived from an 
animal. 

57. A method according to claim 56, wherein the cells are derived from a mammal such 
as a human. 

58. A method according to any of claims 54-57, wherein the fluorescent probe 
redistributes after the cells have been subjected to a physiologically important and 
relevant influence that is relevant to the intercellular signalling pathway wherein the PDE 
splice variant is an integral part, so that both the normal pattern of spatial distribution 
and possible redistribution of the fluorescent probe can be established. 

59. A method according to claim 58 wherein the intracellular signalling pathway 
comprises a cellular response that modulates the generation of cGMP and the spatially 
distributed cytoplasmic levels of cGMP. 
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60. A method according to any of claims 58 or 59, wherein the fluorescent probe is 
modified in a systematic way, still keeping the GFP coding sequence intact, so that the 
new fluorescent probes are fusion polypeptides where parts of the suspected targeting 
sequences of mPDE 5 are altered. 

61. A method according to claim 60, wherein the modification of the suspected targeting 
sequence of the mPDE 5 or a splice variant is a deletion. 

62. A method according to any of claims 60 or 61, wherein the spatial distribution of 
the fluorescent probe is compared to the spatial distribution of the unmodified 
fluorescent probe deducing the targeting sequence. 

63. A method according to any of claims 20-62, wherein the substance interacts with 
the targeting sequence or part thereof in a manner that dislocates, disrupts targeting, or 
interferes with redistribution of the fluorescent probe as measured in quantitative 
fluorescence redistribution assay. 
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ABSTRACT 

This application describes a method by which to identify novel chemical entities that 
may modulate the specific effectiveness of the cyclic nucleotide phosphodiesterases 
(PDEs) having the ability to cleave cGMP. The preferred mode of action is dislocation, 
disruption of targeting or interference with redistribution of specific isoforms or splice 
variants of PDES from their anchoring sites within cells, thereby modulating their 
specific effectiveness, not their enzymatic capacity. The chemical entities may be useful 
in preventing or treating, in an animal, preferably a human, in need thereof, an adverse 
condition which may be reduced or abolished by modulating the specific effectiveness 
of PDE5. Examples of such adverse conditions hypo- and hypertension, erectile 
dysfunction and jet-lag. 
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Modtaget PD 
1 5 OKI, J9gg 

SEQUENCE LISTING 

{1) GENERAL INFORMATION 
( i ) APPLICANT : NovoNordisk , Biolmage 

(ii) TITLE OF THE INVENTION: A method for preventing or treating adverse 

conditions which may be reduced or abolished by modulating the activity 
of one or more phosphodiesterases having the ability to cleave cGMP. 

(iii) NUMBER OF SEQUENCES : 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NovoNordisk, Bio Image 

(B) STREET: Morkho jbygade 28 

(C) CITY: Soborg 

(D) STATE: DK 

(E) COUNTRY: DENMARK 

(F) ZIP: 2860 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE : FastSEQ for Windows Version 2.0 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: , PV&P R 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3378 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ATG GAG CGG GCC GGC CCC AGC TTC GGG CAG CAG CGA CAG CAG CAG CAG 
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Met Glu Arg Ala Gly Pro Ser Fhe Gly Gin Gin Arg Gin Gin Gin Gin 
15 10 15 

CCC CAG CAG CAG AAG CAG CAG CAG AGG GAT CAG GAC TCG GTC GAA GCA 96 
Pro Gin Gin Gin Lys Gin Gin Gin Arg Asp Gin Asp Ser Val Glu Ala 
20 25 30 

TGG CTG GAC GAT CAC TGG GAC TTT ACC TTC TCA TAC TTT GTT AGA AAA 144 
Trp Leu Asp Asp His Trp Asp Phe Thr Phe Ser Tyr Phe Val Arg Lys 
35 40 45 

GCC ACC AGA GAA ATG GTC AAT GCA TGG TTT GCT GAG AGA GTT CAC ACC 192 
Ala Thr Arg Glu Met Val Asn Ala Trp Phe Ala Glu Arg Val His Thr 
50 55 60 

ATC CCT GTG TGC AAG GAA GGT ATC AGA GGC CAC ACC GAA TCT TGC TCT 240 
He Pro Val Cys Lys Glu Gly He Arg Gly His Thr Glu Ser Cys Ser 
65 70 75 80 

TGT CCC TTG CAG CAG AGT CCT CGT GCA GAT AAC AGT GTC CCT GGA ACA 2 88 

Cys Pro Leu Gin Gin Ser Pro Arg Ala Asp Asn Ser Val Pro Gly Thr 
85 90 95 

CCA ACC AGG AAA ATC TCT GCC TCT GAA TTT GAC CGG CCT CTT AGA CCC 336 
Pro Thr Arg Lys He Ser Ala Ser Glu Phe Asp Arg Pro Leu Arg Pro 
100 105 HO 

ATT GTT GTC AAG GAT TCT GAG GGA ACT GTG AGC TTC CTC TCT GAC TCA 384 
He Val Val Lys Asp Ser Glu Gly Thr Val Ser Phe Leu Ser Asp Ser 
115 120 125 

GAA AAG AAG GAA CAG ATG CCT CTA ACC CCT CCA AGG TTT GAT CAT GAT 432 
Glu Lys Lys Glu Gin Met Pro Leu Thr Pro Pro Arg Phe Asp His Asp 
130 135 140 

GAA GGG GAC CAG TGC TCA AGA CTC TTG GAA TTA GTG AAG GAT ATT TCT 480 
Glu Gly Asp Gin Cys Ser Arg Leu Leu Glu Leu Val Lys Asp He Ser 
145 150 155 160 

AGT CAT TTG GAT GTC ACA GCC TTA TGT CAC AAA ATT TTC TTG CAT ATC 528 
Ser His Leu Asp Val Thr Ala Leu Cys His Lys He Phe Leu His He 
165 170 175 

CAT GGA CTG ATA TCT GCT GAC CGC TAT TCC CTG TTC CTT GTC TGT GAA 576 
His Gly Leu He Ser Ala Asp Arg Tyr Ser Leu Phe Leu Val Cys Glu 
180 185 190 

GAC AGC TCC AAT GAC AAG TTT CTT ATC AGC CGC CTC TTT GAT GTT GCT 624 
Asp Ser Ser Asn Asp Lys Phe Leu He Ser Arg Leu Phe Asp Val Ala 
195 200 205 

GAA GGT TCA ACA CTG GAA GM GTT TCA AAT AAC TGT ATC CGC TTA GAA 672 
Glu Gly Ser Thr Leu Glu Glu Val Ser Asn Asn Cys He Arg Leu Glu 
210 215 220 
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TGG AAC AAA GGC ATT GTG GGA CAT GTG GCA GCG CTT GGT GAG CCC TTG 720 
Trp Asn Lys Gly lie Val Gly His Val Ala Ala Leu Gly Glu Pro Leu 
225 230 235 240 

AAC ATC AAA GAT GCA TAT GAG GAT CCT CGG TTC AAT GCA GAA GTT GAC 7 68 

Asn lie Lys Asp Ala Tyr Glu Asp Pro Arg Phe Asn Ala Glu Val Asp 
245 250 255 

CAA ATT ACA GGC TAC AAG ACA CAA AGC ATT CTT TGT ATG CCA ATT AAG 816 
Gin lie Thr Gly Tyr Lys Thr Gin Ser He Leu Cys Met Pro He Lys 
260 265 270 

AAT CAT AGG GAA GAG GTT GTT GGT GTA GCC CAG GCC ATC AAC AAG AAA 864 
Asn His Arg Glu Glu Val Val Gly Val Ala Gin Ala He Asn Lys Lys 
275 280 285 

TCA GGA AAC GGT GGG ACA TTT ACT GAA AAA GAT GAA AAG GAC TTT GCT 912 
Ser Gly Asn Gly Gly Thr Phe Thr Glu Lys Asp Glu Lys Asp Phe Ala 
290 295 300 

GCT TAT TTG GCA TTT TGT GGT ATT GTT CTT CAT AAT GCT CAG CTC TAT 960 
Ala Tyr Leu Ala Phe Cys Gly He Val Leu His Asn Ala Gin Leu Tyr 
305 310 315 320 

GAG ACT TCA CTG CTG GAG AAC AAG AGA AAT CAG GTG CTG CTT GAC CTT 1008 
Glu Thr Ser Leu Leu Glu Asn Lys Arg Asn Gin Val Leu Leu Asp Leu 
325 330 335 

GCT AGT TTA ATT TTT GAA GAA CAA CAA TCA TTA GAA GTA ATT TTG AAG 1056 
Ala Ser Leu He Phe Glu Glu Gin Gin Ser Leu Glu Val He Leu Lys 
340 345 350 

AAA ATA GCT GCC ACT ATT ATC TCT TTC ATG CAA GTG CAG AAA TGC ACC 1104 
Lys lie Ala Ala Thr He He Ser Phe Met Gin Val Gin Lys Cys Thr 
355 360 365 

ATT TTC ATA GTG GAT GAA GAT TGC TCC GAT TCT TTT TCT AGT GTG TTT 1152 
He Phe He Val Asp Glu Asp Cys Ser Asp Ser Phe Ser Ser Val Phe 
370 375 380 

CAC ATG GAG TGT GAG GAA TTA GAA AAA TCA TCT GAT ACA TTA ACA AGG 1200 
His Met Glu Cys Glu Glu Leu Glu Lys Ser Ser Asp Thr Leu Thr Arg 
385 390 395 400 

GAA CAT GAT GCA AAC AAA ATC AAT TAC ATG TAT GCT CAG TAT GTC AAA 1248 
Glu His Asp Ala Asn Lys He Asn Tyr Met Tyr Ala Gin Tyr Val Lys 
405 410 415 

AAT ACT ATG GAA CCA CTT AAT ATC CCA GAT GTC AGT AAG GAT AAA AGA 1296 
Asn Thr Met Glu Pro Leu Asn He Pro Asp Val Ser Lys Asp Lys Arg 
420 425 430 

TTT CCC TGG ACA ACT GAA AAT ACA GGA AAT GTA AAC CAG CAG TGC ATT 1344 



Phe Pro Trp Thr Thr Glu Asn Thr Gly Asn Val Asn Gin Gin Cys lie 
435 440 445 

AGA AGT TTG CTT TGT ACA CCT ATA AAA AAT GGA AAG AAG AAT AAA GTT 13 92 

Arg Ser Leu Leu Cys Thr Pro lie Lys Asn Gly Lys Lys Asn Lys Val 
450 455 460 

ATA GGG GTT TGC CAA CTT GTT AAT AAG ATG GAG GAG AAT ACT GGC AAG 1440 

lie Gly Val Cys Gin Leu Val Asn Lys Met Glu Glu Asn Thr Gly Lys 

465 470 475 480 

GTT AAG CCT TTC AAC CGA AAT GAC GAA GAG TTT CTG GAA GCT TTT GTC 1488 

Val Lys Pro Phe Asn Arg Asn Asp Glu Gin Phe Leu Glu Ala Phe Val 
485 490 495 

ATC TTT TGT GGC TTG GGG ATC CAG AAC ACG CAG ATG TAT GAA GCA GTG 1536 

lie Phe Cys Gly Leu Gly lie Gin Asn Thr Gin Met Tyr Glu Ala Val 
500 505 510 

GAG AGA GCC ATG GCC AAG CAA ATG GTC ACA TTG GAG GTT CTG TCG TAT 1584 

Glu Arg Ala Met Ala Lys Gin Met Val Thr Leu Glu Val Leu Ser Tyr 
515 520 525 

CAT GCT TCA GCA GCA GAG GAA GAA ACA AGA GAG CTA CAG TCG TTA GCG 1632 

His Ala Ser Ala Ala Glu Glu Glu Thr Arg Glu Leu Gin Ser Leu Ala 
530 535 540 

GCT GCT GTG GTG CCA TCT GCC CAG ACC CTT AAA ATT ACT GAC TTT AGC 1680 

Ala Ala Val Val Pro Ser Ala Gin Thr Leu Lys lie Thr Asp Phe Ser 

545 550 555 560 

TTC AGT GAC TTT GAG CTG TCT GAT CTG GAA ACA GCA CTG TGC ACA ATT 1728 

Phe Ser Asp Phe Glu Leu Ser Asp Leu Glu Thr Ala Leu Cys Thr lie 
565 570 575 

CGG ATG TTT ACT GAC CTC AAC CTT GTG CAG AAC TTC CAG ATG AAA CAT 1776 

Arg Met Phe Thr Asp Leu Asn Leu Val Gin Asn Phe Gin Met Lys His 
580 585 590 

GAG GTT CTT TGC AGA TGG ATT TTA AGT GTT AAG AAG AAT TAT CGG AAG 1824 

Glu Val Leu Cys Arg Trp lie Leu Ser Val Lys Lys Asn Tyr Arg Lys 
595 600 605 

AAT GTT GCC TAT CAT AAT TGG AGA CAT GCC TTT AAT ACA GCT CAG TGC 1872 

Asn Val Ala Tyr His Asn Trp Arg His Ala Phe Asn Thr Ala Gin Cys 
610 615 620 

ATG TTT GCT GCT CTA AAA GCA GGC AAA ATT CAG AAC AAG CTG ACT GAC 1920 

Met Phe Ala Ala Leu Lys Ala Gly Lys lie Gin Asn Lys Leu Thr Asp 

625 630 635 640 

CTG GAG ATA CTT GCA TTG CTG ATT GCT GCA CTA AGC CAC GAT TTG GAT 1968 

Leu Glu lie Leu Ala Leu Leu lie Ala Ala Leu Ser His Asp Leu Asp 
645 650 655 
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CAC CGT GGT GTG AAT AAC TCT TAC ATA CAG CGA AGT GAA CAT CCA CTT 2 016 
His Arg Gly Val Asn Asn Ser Tyr lie Gin Arg Ser Glu His Pro Leu 
660 665 670 

GCC CAG CTT TAC TGC CAT TCA ATC ATG GAA CAC CAT CAT TTT GAC CAG 2 064 
Ala Gin Leu Tyr Cys His Ser lie Met Glu His His His Phe Asp Gin 
675 680 685 

TGC CTG ATG ATT CTT AAT AGT CCA GGC AAT CAG ATT CTC AGT GGC CTC 2112 
Cys Leu Met lie Leu Asn Ser Pro Gly Asn Gin lie Leu Ser Gly Leu 
690 695 700 

TCC ATT GAA GAA TAT AAG ACC ACG TTG AAA ATA ATC AAG CAA GCT ATT 2160 
Ser He Glu Glu Tyr Lys Thr Thr Leu Lys He He Lys Gin Ala He 
705 710 715 720 

TTA GCT ACA GAC CTA GCA CTG TAC ATT AAG AGG CGA GGA GAA TTT TTT 22 08 
Leu Ala Thr Asp Leu Ala Leu Tyr He Lys Arg Arg Gly Glu Phe Phe 
725 730 735 

GAA CTT ATA AGA AAA AAT CAA TTC AAT TTG GAA GAT CCT CAT CAA AAG 2256 
Glu Leu He Arg Lys Asn Gin Phe Asn Leu Glu Asp Pro His Gin Lys 
740 745 750 

GAG TTG TTT TTG GCA ATG CTG ATG ACA GCT TGT GAT CTT TCT GCA ATT 2304 
Glu Leu Phe Leu Ala Met Leu Met Thr Ala Cys Asp Leu Ser Ala He 
755 760 765 

ACA AAA CCC TGG CCT ATT CAA CAA CGG ATA GCA GAA CTT GTA GCA ACT 2352 
Thr Lys Pro Trp Pro lie Gin Gin Arg He Ala Glu Leu Val Ala Thr 
770 775 780 

GAA TTT TTT GAT CAA GGA GAC AGA GAG AGA AAA GAA CTC AAC ATA GAA 2400 
Glu Phe Phe Asp Gin Gly Asp Arg Glu Arg Lys Glu Leu Asn He Glu 
785 790 795 800 

CCC ACT GAT CTA ATG AAC AGG GAG AAG AAA AAC AAA ATC CCA AGT ATG 2448 
Pro Thr Asp Leu Met Asn Arg Glu Lys Lys Asn Lys He Pro Ser Met 
805 810 815 

CAA GTT GGG TTC ATA GAT GCC ATC TGC TTG CAA CTG TAT GAG GCC CTG 2496 
Gin Val Gly Phe He Asp Ala lie Cys Leu Gin Leu Tyr Glu Ala Leu 
820 825 830 

ACC CAC GTG TCA GAG GAC TGT TTC CCT TTG CTA GAT GGC TGC AGA AAG 2544 
Thr His Val Ser Glu Asp Cys Phe Pro Leu Leu Asp Gly Cys Arg Lys 
835 840 845 

AAC AGG CAG AAA TGG CAG GCC CTT GCA GAA CAG CAG GAG AAG ATG CTG 2592 
Asn Arg Gin Lys Trp Gin Ala Leu Ala Glu Gin Gin Glu Lys Met Leu 
850 855 860 

ATT AAT GGG GAA AGC GGC CAG GCC AAG CGG AAC TGG GTA CCG CGG GCC 2640 



6 



lie Asn Gly Glu Ser Gly Gin Ala Lys Arg Asn Trp Val Pro Arg Ala 
865 870 875 880 

CGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC 2688 
Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
885 890 895 

ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 27 3 6 
Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 
900 905 910 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 2784 
His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
915 920 925 

AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 2832 
Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro 
930 935 940 

TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 2880 
Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
945 950 955 960 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 2928 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
965 970 975 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 2976 
Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 
980 985 990 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 3024 
Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
995 1000 1005 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 3072 
Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
1010 1015 1020 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 3120 
Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
1025 1030 1035 1040 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 3168 
Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
1045 1050 1055 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 3216 
His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 
1060 1065 1070 

AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 32 64 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
1075 1080 1085 



CTG AGC ACC CAG TCC GCC CTG AGC AAA 
Leu Ser Thr Gin Ser Ala Leu Ser Lys 
1090 1095 

CAC ATG GTC CTG CTG GAG TTC GTG ACC 
His Met Val Leu Leu Glu Phe Val Thr 
1105 1110 

ATG GAC GAG CTG TAC AAG TAA 
Met Asp Glu Leu Tyr Lys 
1125 



GAC CCC AAC GAG AAG CGC GAT 3312 
Asp Pro Asn Glu Lys Arg Asp 
1100 

GCC GCC GGG ATC ACT CTC GGC 3 3 60 

Ala Ala Gly lie Thr Leu Gly 
1115 1120 

3381 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1126 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Arg Ala Gly Pro Ser Phe Gly Gin Gin Arg Gin Gin Gin Gin 

15 10 15 

Pro Gin Gin Gin Lys Gin Gin Gin Arg Asp Gin Asp Ser Val Glu Ala 

20 25 30 

Trp Leu Asp Asp His Trp Asp Phe Thr Phe Ser Tyr Phe Val Arg Lys 

35 40 45 

Ala Thr Arg Glu Met Val Asn Ala Trp Phe Ala Glu Arg Val His Thr 

50 55 60 

lie Pro Val Cys Lys Glu Gly lie Arg Gly His Thr Glu Ser Cys Ser 
65 70 75 80 

Cys Pro Leu Gin Gin Ser Pro Arg Ala Asp Asn Ser Val Pro Gly Thr 

85 90 95 

Pro Thr Arg Lys lie Ser Ala Ser Glu Phe Asp Arg Pro Leu Arg Pro 

100 105 110 

lie Val Val Lys Asp Ser Glu Gly Thr Val Ser Phe Leu Ser Asp Ser 

115 120 125 

Glu Lys Lys Glu Gin Met Pro Leu Thr Pro Pro Arg Phe Asp His Asp 

130 135 140 

Glu Gly Asp Gin Cys Ser Arg Leu Leu Glu Leu Val Lys Asp lie Ser 
145 150 155 160 

Ser His Leu Asp Val Thr Ala Leu Cys His Lys lie Phe Leu His lie 

165 170 175 

His Gly Leu lie Ser Ala Asp Arg Tyr Ser Leu Phe Leu Val Cys Glu 

180 185 190 

Asp Ser Ser Asn Asp Lys Phe Leu lie Ser Arg Leu Phe Asp Val Ala 

195 200 205 

Glu Gly Ser Thr Leu Glu Glu Val Ser Asn Asn Cys lie Arg Leu Glu 



210 

Trp Asn Lys Gly 
225 

Asn He Lys Asp 

Gin He Thr Gly 
260 

Asn His Arg Glu 
275 

Ser Gly Asn Gly 
290 

Ala Tyr Leu Ala 
305 

Glu Thr Ser Leu 

Ala Ser Leu He 
340 

Lys He Ala Ala 
355 

He Phe He Val 
370 

His Met Glu Cys 
385 

Glu His Asp Ala 

Asn Thr Met Glu 
420 

Phe Pro Trp Thr 
435 

Arg Ser Leu Leu 
450 

He Gly Val Cys 
465 

Val Lys Pro Phe 

He Phe Cys Gly 
500 

Glu Arg Ala Met 
515 

His Ala Ser Ala 

530 

Ala Ala Val Val 
545 

Phe Ser Asp Phe 

Arg Met Phe Thr 
580 

Glu Val Leu Cys 
595 

Asn Val Ala Tyr 
610 

Met Phe Ala Ala 
625 

Leu Glu He Leu 



215 

He Val Gly His 
230 

Ala Tyr Glu Asp 
245 

Tyr Lys Thr Gin 

Glu Val Val Gly 
280 

Gly Thr Phe Thr 
295 

Phe Cys Gly He 
310 

Leu Glu Asn Lys 
325 

Phe Glu Glu Gin 

Thr He He Ser 
360 

Asp Glu Asp Cys 
375 

Glu Glu Leu Glu 
390 

Asn Lys He Asn 
405 

Pro Leu Asn He 

Thr Glu Asn Thr 
440 

Cys Thr Pro He 
455 

Gin Leu Val Asn 
470 

Asn Arg Asn Asp 
485 

Leu Gly He Gin 

Ala Lys Gin Met 
520 

Ala Glu Glu Glu 
535 

Pro Ser Ala Gin 
550 

Glu Leu Ser Asp 
565 

Asp Leu Asn Leu 

Arg Trp He Leu 
600 

His Asn Trp Arg 
615 

Leu Lys Ala Gly 
630 

Ala Leu Leu He 



220 

Val Ala Ala Leu 
235 

Pro Arg Phe Asn 
250 

Ser He Leu Cys 
265 

Val Ala Gin Ala 

Glu Lys Asp Glu 
300 

Val Leu His Asn 
315 

Arg Asn Gin Val 
330 

Gin Ser Leu Glu 
345 

Phe Met Gin Val 

Ser Asp Ser Phe 
380 

Lys Ser Ser Asp 
395 

Tyr Met Tyr Ala 
410 

Pro Asp Val Ser 
425 

Gly Asn Val Asn 

Lys Asn Gly Lys 
460 

Lys Met Glu Glu 
475 

Glu Gin Phe Leu 
490 

Asn Thr Gin Met 
505 

Val Thr Leu Glu 

Thr Arg Glu Leu 
540 

Thr Leu Lys He 
555 

Leu Glu Thr Ala 
570 

Val Gin Asn Phe 
585 

Ser Val Lys Lys 

His Ala Phe Asn 
620 

Lys He Gin Asn 
635 

Ala Ala Leu Ser 



Gly Glu Pro Leu 
240 

Ala Glu Val Asp 
255 

Met Pro He Lys 
270 

He Asn Lys Lys 
285 

Lys Asp Phe Ala 

Ala Gin Leu Tyr 
320 

Leu Leu Asp Leu 
335 

Val He Leu Lys 
350 

Gin Lys Cys Thr 
365 

Ser Ser Val Phe 

Thr Leu Thr Arg 
400 

Gin Tyr Val Lys 
415 

Lys Asp Lys Arg 
430 

Gin Gin Cys He 
445 

Lys Asn Lys Val 

Asn Thr Gly Lys 
480 

Glu Ala Phe Val 
495 

Tyr Glu Ala Val 
510 

Val Leu Ser Tyr 
525 

Gin Ser Leu Ala 

Thr Asp Phe Ser 
560 

Leu Cys Thr He 
575 

Gin Met Lys His 
590 

Asn Tyr Arg Lys 
605 

Thr Ala Gin Cys 

Lys Leu Thr Asp 
640 

His Asp Leu Asp 



645 650 655 

His Arg Gly Val Asn Asn Ser Tyr lie Gin Arg Ser Glu His Pro Leu 

660 665 670 

Ala Gin Leu Tyr Cys His Ser lie Met Glu His His His Phe Asp Gin 

675 680 685 

Cys Leu Met lie Leu Asn Ser Pro Gly Asn Gin lie Leu Ser Gly Leu 

690 695 700 

Ser lie Glu Glu Tyr Lys Thr Thr Leu Lys lie lie Lys Gin Ala lie 
705 710 715 720 

Leu Ala Thr Asp Leu Ala Leu Tyr lie Lys Arg Arg Gly Glu Phe Phe 

725 730 735 

Glu Leu He Arg Lys Asn Gin Phe Asn Leu Glu Asp Pro His Gin Lys 

740 745 750 

Glu Leu Phe Leu Ala Met Leu Met Thr Ala Cys Asp Leu Ser Ala He 

755 760 765 

Thr Lys Pro Trp Pro He Gin Gin Arg He Ala Glu Leu Val Ala Thr 

770 775 780 

Glu Phe Phe Asp Gin Gly Asp Arg Glu Arg Lys Glu Leu Asn lie Glu 
785 790 795 800 

Pro Thr Asp Leu Met Asn Arg Glu Lys Lys Asn Lys He Pro Ser Met 

805 810 815 

Gin Val Gly Phe He Asp Ala He Cys Leu Gin Leu Tyr Glu Ala Leu 

820 825 830 

Thr His Val Ser Glu Asp Cys Phe Pro Leu Leu Asp Gly Cys Arg Lys 

835 840 , 845 

Asn Arg Gin Lys Trp Gin Ala Leu Ala Glu Gin Gin Glu Lys Met Leu 

850 855 860 

He Asn Gly Glu Ser Gly Gin Ala Lys Arg Asn Trp Val Pro Arg Ala 
865 870 875 880 

Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 

885 890 895 

Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

900 ' 905 910 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

915 920 925 

Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 

930 935 940 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
945 950 955 960 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 

965 970 975 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

980 985 990 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 

995 1000 1005 

Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 

1010 1015 1020 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
025 1030 1035 1040 

Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 

1045 1050 1055 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

1060 1065 1070 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 



1075 1080 1085 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 

1090 1095 1100 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
105 HIO 1H5 H20 

Met Asp Glu Leu Tyr Lys 
1125 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTGAATTCAA CCATGGAGCG GGCC 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTGGT AC C C A GTTCCGCTTG GCC 



Modtaget PO 

22IMDK ' ' t 5 OKI, 1998 

AN IMPROVED METHOD for extracting quantitative information relating to an 
influence on a cellular response. 



SUMMARY OF THE INVENTION 

5 The present invention relates to an improved method and tools for extracting quantitative 
information relating to an influence on a cellular response, in particular an influence 
caused by contacting or incubating the cell with a substance influencing a cellular 
response, wherein the cellular response is manifested in redistribution of at least one 
component in the cell. In particular, the invention relates to an improved method for 

10 extracting the quantitative information relating to an influence on an intracellular pathway 
involving redistribution of at least one component associated with the pathway. The 
method of the invention may be used as a very efficient procedure for testing or 
discovering the influence of a substance on a physiological process, for example in 
connection with screening for new drugs, testing of substances for toxicity, identifying 

15 drug targets for known or novel drugs. In particular, the present invention relates to an 
improved method for parallelisation of the testing procedure so that a large number of 
substances can be tested simultaneously using commercially available instrumentation. 
The invention also describes several ways of contacting the cells with a substance 
influencing a cellular response and modifications made to the actual cells before, during or 

20 after contacting the cells with these substances as to improve the applicability and use of 
the method for extracting quantitative information relating to influence on an intracellular 
pathway in a highly parallel fashion. Other valuable uses of the method and technology of 
the invention will be apparent to the skilled person on the basis of the following disclosure. 
In a particular embodiment of the invention, the present invention relates to a method of 

25 detecting intracellular translocation or redistribution of biologically active polypeptides, 
preferably an enzyme, affecting intracellular processes, and a DNA construct and a cell for 
use in the method. 

Two appendices are included herein, and are considered part of the application. Appendix 
I, "METHOD AND APPARATUS FOR HIGH DENSITY FORMAT SCREENING FOR 
30 BIOACTIVE MOLECULES", is a pending patent application. Appendix II, "CHANGES 
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IN INTRACELLULAR cAMP VISUALIZED USING A cAMP-DEPENDENT PROTEIN 
KINASE-GREEN FLUORESCENT PROTEIN HYBRID", is a manuscript intended for 
publication. 

5 BACKGROUND OF THE INVENTION 

Intracellular pathways are tightly regulated by a cascade of components that undergo 
modulation in a temporally and spatially characteristic manner. Several disease states can 
be attributed to altered activity of individual signalling components (i.e. protein kinases, 
protein phosphatases, transcription factors). These components therefore render 
10 themselves as attractive targets for therapeutic intervention. 

Protein kinases and phosphatases are well described components of several intracellular 
signalling pathways. The catalytic activity of protein kinases and phosphatases are 
assumed to play a role in virtually all regulatable cellular processes. Although the 
involvement of protein kinases in cellular signalling and regulation have been subjected to 
15 extensive studies, detailed knowledge on e.g. the exact timing and spatial characteristics of 
signalling events is often difficult to obtain due to lack of a convenient technology. 

Novel ways of monitoring specific modulation of intracellular pathways in intact, living 
cells is assumed to provide new opportunities in drug discovery, functional genomics, 
toxicology, patient monitoring etc. 

20 The spatial orchestration of protein kinase activity is likely to be essential for the high 
degree of specificity of individual protein kinases. The phosphorylation mediated by 
protein kinases is balanced by phosphatase activity. Also within the family of phosphatases 
translocation has been observed, e.g. translocation of PTP2C to membrane ruffles 
[(Cossette et al 1996)], and likewise is likely to be indicative of phosphatase activity. 

25 Protein kinases often show a specific intracellular distribution before, during and after 
activation. Monitoring the translocation processes and/or redistribution of individual 
protein kinases or subunits thereof is thus likely to be indicative of their functional 
activity. A connection between translocation and catalytic activation has been shown for 
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protein kinases like the diacyl glycerol (DAG)-dependent protein kinase C (PKC), the 
cAMP-dependent protein kinase (PKA) [(DeBernardi et al 1996)] and the mitogen- 
activated-protein kinase Erk-1 [(Sano et al 1995)]. 

Commonly used methods of detection of intracellular localisation/activity of protein 
5 kinases and phosphatases are immunoprecipitation, Western blotting and 
immunocytochemical detection. 

Taking the family of diacyl glycerol (DAG)-dependent protein kinase Cs (PKCs) as an 
example, it has been shown that individual PKC isoforms that are distributed among 
different tissues and cells have different activator requirements and undergo differential 

10 translocation in response to activation. Catalytically inactive DAG-dependent PKCs are 
generally distributed throughout the cytoplasm, whereas they upon activation translocate 
to become associated with different cellular components, e.g. plasma membrane [(Farese, 
1992),(Fulop Jr. et al 1995)] nucleus [(Khalil et al 1992)], cytoskeleton [(Blobe et 
al 1996)]. The translocation phenomenon being indicative of PKC activation has been 

15 monitored using different approaches: a) immunocytochemistry where the localisation of 
individual isoforms can be detected after permeabilisation and fixation of the cells [(Khalil 
et al 1992)]; and b) tagging all DAG-dependent PKC isoforms with a fluorescently 
labelled phorbol myristate acetate (PMA) [(Godson et al 1996)]; and c) chemical tagging 
of PKC pi with the fluorophore Cy3 [(Bastiaens & Jovin 1996)] and d) genetic tagging of 

20 PKC a([Schmidt et al 1997]) and of PKC y and PKC 5 [(Sakai et al 1996)]. The first 
method does not provide dynamic information whereas the latter methods will. Tagging 
PKC with fluorescently labelled phorbol myristate acetate cannot distinguish between 
different DAG-dependent isoforms of PKC but will label and show movement of all 
isoforms. Chemical and genetic labelling of specific DAG-dependent PKCs confirmed that 

25 they in an isoform specific manner upon activation move to cell periphery or nucleus. 

In an alternative method, protein kinase A activity has been measured in living cells by 
chemical labelling one of the kinase's subunit [(Adams et al 1991)]. The basis of the 
methodology is that the regulatory and catalytic subunit of purified protein kinase A is 
labelled with fluorescein and rhodamine, respectively. At low cAMP levels protein kinase 
30 A is assembled in a heterotetrameric form which enables fluorescence resonance energy 
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transfer between the two fluorescent dyes. Activation of protein kinase A leads to 
dissociation of the complex, thereby eliminating the energy transfer. A disadvantage of 
this technology is that the labelled protein kinase A has to be microinjected into the cells 
of interest. This highly invasive technique is cumbersome and not applicable to large scale 
5 screening of biologically active substances. A further disadvantage of this technique as 

compared to the presented invention is that the labelled protein kinase A cannot be inserted 
into organisms/animals as a transgene. 

Recently it was discovered that Green Fluorescent Protein (GFP) expressed in many 
different cell types, including mammalian cells, became highly fluorescent [(Chalfie et 

10 al 1994)]. WO95/07463 describes a cell capable of expressing GFP and a method for 
detecting a protein of interest in a cell based on introducing into a cell a DNA molecule 
having DNA sequence encoding the protein of interest linked to DNA sequence encoding a 
GFP such that the protein produced by the DNA molecule will have the protein of interest 
fused to the GFP, then culturing the cells in conditions permitting expression of the fused 

15 protein and detecting the location of the fluorescence in the cell, thereby localizing the 

protein of interest in the cell. However, examples of such fused proteins are not provided, 
and the use of fusion proteins with GFP for detection or quantitation of translocation or 
redistribution of biologically active polypeptides affecting intracellular processes upon 
activation, such as proteins involved in signalling pathways, e.g. protein kinases or 

20 phosphatases, has not been suggested. WO 95/07463 further describes cells useful for the 
detection of molecules, such as hormones or heavy metals, in a biological sample, by 
operatively linking a regulatory element of the gene which is affected by the molecule of 
interest to a GFP, the presence of the molecules will affect the regulatory element which in 
turn will affect the expression of the GFP. In this way the gene encoding GFP is used as a 

25 reporter gene in a cell which is constructed for monitoring the presence of a specific 
molecular identity. 

Green Fluorescent Protein has been used in an assay for the detection of translocation of 
the glucocorticoid receptor (GR) [(Carey, KL et al 1996)]. A GR-S65TGFP fusion has 
been used to study the mechanisms involved in translocation of the glucocorticoid receptor 
30 (GR) in response to the agonist dexamethasone from the cytosol, where it is present in the 
absence of a ligand, through the nuclear pore to the nucleus where it remains after ligand 
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binding. The use of a GR-GFP fusion enables real-time imaging and quantitation of 
nuclear/cytoplasmic ratios of the fluorescence signal. A similar genetic construct has been 
used to follow and quantify dexamethasone induced translocation of GR to the nucleus in 
HeLa cells [(Guiliano, K.A et al 1997)] in a system called Array Scan™ (WO 97/45730) 
5 designed for automated drug screening. Recently, several other investigators have 
demonstrated that tagging a specific protein (or part of a protein) involved in an 
intracellular signalling pathway with GFP provides a new means to measure and quantify 
the influence of substances on this pathway. The concept has been shown to work both for 
cytoplasmic to nuclear translocation of the androgen receptor [(Georget V et al 1997)] and 

10 transcription factors such as NF-ATc [(Beals CR et al 1997)] in analogy with what has 
already been described for GR above. Another relevant example is a p-arrestin - GFP 
construct that was shown to report on activation of G-protein coupled receptors by 
translocating from the cytosol to the plasma membrane [(Barak LS et al 1997)]. Finally, it 
has also been demonstrated that attaching GFP to a smaller part of a protein like the 

15 pleckstrin homology domain of phospholipase C 5 1 [(Stauffer TP et al 1998)] and a 
cysteine-rich domain of PKC y [(Oancea E et al 1998)] can be used to report on an 
influence from a substance by quantifying their redistribution within the cells during 
activation of the specific signalling pathway to which they belong. 

Many currently used screening programmes designed to find compounds that affect protein 
20 kinase activity are based on measurements of kinase phosphorylation of artificial or natural 
substrates, receptor binding and/or reporter gene expression. The interest in fluorescence 
measurements as the basis for future high-throughput drug screening has however 
increased dramatically over the last few years [(Silverman L et al 1998)]. Of particular 
interest to the present invention is a scanning laser imager for rapid screening of 
25 fluorescence changes in living cells [(Schroeder K & Neagle B 1996)] currently offered 
commercially by Molecular Devices, Inc. as the FLIPR™. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides an important new dimension in the investigation of cellular 
30 systems involving redistribution in that the invention provides quantification of the 
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redistribution responses or events caused by an influence, typically contact with a 
chemical substance or mixture of chemical substances, but also changes in the physical 
environment. The quantification makes it possible to set up meaningful relationships, 
expressed numerically, or as curves or graphs, between the influences (or the degree of 
5 influences) on cellular systems and the redistribution response. This is highly 

advantageous because, as has been found, the quantification can be achieved in both a fast 
and reproducible manner, and - what is perhaps even more important - the systems which 
become quantifiable utilising the method of the invention are systems from which 
enormous amounts of new information and insight can be derived. 

10 The present screening assays have the distinct advantage over other screening assays, e.g., 
receptor binding assays, enzymatic assays, and reporter gene assays, in providing a system 
in which biologically active substances with completely novel modes of action, e.g. 
inhibition or promotion of redistribution/translocation of a biologically active polypeptide 
as a way of regulating its action rather than inhibition/activation of enzymatic activity, can 

15 be identified in a way that insures very high selectivity to the particular isoform of the 
biologically active polypeptide and further development of compound selectivity versus 
other isoforms of the same biologically active polypeptide or other components of the 
same signalling pathway. 

In its broadest aspect, the invention relates to an improved method, with higher throughput 
20 compared to previous methods, for extracting quantitative information relating to an 

influence on a cellular response, the method comprising recording variation, caused by the 
influence on mechanically intact living cells, in spatially distributed light emitted from a 
luminophore, the luminophore being present in the cells and being capable of being 
redistributed in a manner which is related with the degree of the influence, and/or of being 
25 modulated by a component which is capable of being redistributed in a manner which is 
related to the degree of the influence, the association resulting in a modulation of the 
luminescence characteristics of the luminophore, detecting and recording the spatially 
distributed light from the luminophore, and processing the recorded variation in the 
spatially distributed light to provide quantitative information correlating the spatial 
30 distribution or change in the spatial distribution to the degree of the influence. In one 

aspect of the present invention the mechanically intact living cell is permeabilised at some 
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time after the influence has begun but during or before the actual experimental recording. 
In another aspect, the present invention relates to an improved method for extracting 
quantitative information relating to an influence on a cellular response, the method 
comprising recording variation, caused by the influence on permeabilised living cells, in 
5 spatially distributed light emitted from a luminophore, the luminophore being present in 
the cells and being capable of being redistributed in a manner which is related with the 
degree of the influence, and/or of being modulated by a component which is capable of 
being redistributed in a manner which is related to the degree of the influence, the 
association resulting in a modulation of the luminescence characteristics of the 

10 luminophore, detecting and recording the spatially distributed light from the luminophore, 
and processing the recorded variation in the spatially distributed light to provide 
quantitative information con-elating the spatial distribution or change in the spatial 
distribution to the degree of the influence. In a preferred embodiment of the invention the 
luminophore, which is present in the cells, is capable of being redistributed by modulation 

15 of an intracellular pathway, in a manner which is related to the redistribution of at least 
one component of the intracellular pathway. In another preferred embodiment of the 
invention, the luminophore is a fluorophore. 

In the invention the cell and/or cells are mechanically intact and alive throughout the 
experiment. In another embodiment of the invention, the cells are fixed at a point in time 

20 after the application of the influence at which the response has been predetermined to be 
significant, and the recording is made at an arbitrary later time. In another embodiment the 
cell and/or cells are mechanically intact and alive throughout the experiment but are 
mechanically or chemically disrupted or permeabilised as the initial step of experimental 
analysis. In another aspect of the invention the cells have their plasma membrane 

25 permanently and stably permeabilised before the initiation of the experiment in such a way 
that the plasma membrane stays permeable during the experiment. This allows the 
components of intracellular pathways to be contacted by substances that are not normally 
permeating the cell plasma membrane such as peptides, proteins and hydrophilic organic 
compounds. 

30 The mechanically intact or permeabilised living cells could be selected from the group 
consisting of fungal cells, such as yeast cells; invertebrate cells including insect cells; and 
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vertebrate cells, such as mammalian cells. These cells are incubated at a temperature of 
30°C or above, preferably at a temperature of from 32°C to 39°C, more preferably at a 
temperature of from 35°C to 38°C, and most preferably at a temperature of about 37°C 
during the time period over which the influence is observed. In one aspect of the invention 
5 the mechanically intact or permeabilised living cell is part of a matrix of identical or non- 
identical cells. In one embodiment of the invention the cells comprise a group or groups of 
cells contained within a spatial limitation or spatial limitations. In one embodiment, the 
cells comprise multiple groups of cells that are qualitatively the same but subjected to 
different influences. In another embodiment, the cells comprise multiple groups of cells 
10 that are qualitatively different but subjected to the same influence. 

In one embodiment of the invention the spatial limitations are domains defined on a 
substrate on which the cells are present. The spatial limitations may be arranged in one or 
more arrays on a common carrier. The spatial limitations may be wells in a plate of 
microtiter type, such that 96, 384, 864 and 1536 wells are situated on the common carrier. 

15 In another embodiment the spatial limitations are wells in a plate of a format different 

from the microtiter type. In one embodiment of the invention the domains are established 
by the presence of the cells on the substrate in a pattern that defines the domains. In 
another aspect of the invention, the domains are instead established by the spatial pattern 
or array of the influence or influences as it/they are applied to or contacted by the cells. 

20 This aspect is thoroughly described in Appendix I. Briefly, in this aspect of the invention 
the mechanically intact or permeabilised living cells are part of a continuous or 
discontinuous sheet of cells cultured on an optically clear flat surface optimised or not for 
cell culture. The optically clear and flat surface may be a porous membrane that may allow 
cellular processes to grow through the membrane pores and may allow directed capillary 

25 flow of fluid through the pores. 

A cell used in the present invention should contain a nucleic acid construct encoding a 
fusion polypeptide as defined herein and be capable of expressing the sequence encoded 
by the construct. The cell is a eukaryotic cell selected from the group consisting of fungal 
cells, such as yeast cells; invertebrate cells including insect cells; vertebrate cells such as 
30 mammalian cells. The preferred cells are mammalian cells. 
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In another aspect of the invention the cells could be from an organism carrying in at least 
one of its component cells a nucleic acid sequence encoding a fusion polypeptide as 
defined herein and be capable of expressing said nucleic acid sequence. The organism is 
selected from the group consisting of unicellular and multicellular organisms, such as a 
5 mammal. 

The luminophore is the component that allows the redistribution to be visualised and/or 
recorded by emitting light in a spatial distribution related to the degree of influence. The 
term redistribution is intended to cover all aspects of a change in spatial location, such as a 
translocation of the luminophore or other components. In one embodiment of the 

10 invention, the luminophore is capable of being redistributed in a manner that is 

physiologically relevant to the degree of the influence. It should be understood that 
redistribution. In another embodiment, the luminophore is capable of associating with a 
component that is capable of being redistributed in a manner that is physiologically 
relevant to the degree of the influence. In another embodiment, a correlation between the 

15 redistribution of the luminophore and the degree of the influence could be determined 

experimentally. In a preferred aspect of the invention, the luminophore is capable of being 
redistributed in substantially the same manner as the at least one component of an 
intracellular pathway. In another embodiment of the invention, the luminophore is capable 
of being quenched upon spatial association with a component that is redistributed by 

20 modulation of the pathway, the quenching being measured as a change in the intensity of 
the luminescence. In another embodiment of the invention, the luminophore is stationary 
but may have a certain spatial distribution, and interacts with at least one component that is 
capable of being redistributed in a manner which is physiologically relevant to the degree 
of the influence, in such a way that one or more luminescence characteristics of the 

25 luminophore is/are modulated as the component moves closer to, or farther from, the 
luminophore. 

The luminophore could be a fluorophore. In a preferred embodiment of the invention, the 
luminophore is a polypeptide encoded by and expressed from a nucleotide sequence 
harboured in the cells. The luminophore could be a hybrid polypeptide comprising a fusion 
30 of at least a portion of each of two polypeptides one of which comprises a luminescent 
polypeptide and the other one of which comprises a biologically active polypeptide, as 
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defined herein. 

The luminescent polypeptide could be a GFP as defined herein or could be selected from 
the group consisting of green fluorescent proteins having the F64L mutation as defined 
herein such as F64L-GFP, F64L-Y66H-GFP, F64L-S65T-GFP, and EGFP. The GFP could 
5 be N- or C-terminally tagged, optionally via a peptide linker, to the biologically active 
polypeptide or a part or a subunit thereof. The fluorescent probe could be a component of 
an intracellular signalling pathway. The probe is coded for by a nucleic acid construct. 

The pathway of investigation in the present invention could be an intracellular signalling 
pathway. 

10 In a preferred embodiment of the invention, the influence could be contact between the 
group or groups of mechanically intact or permeabilised living cells and a chemical 
substance, and/or incubation of the group or groups of mechanically intact or 
permeabilised living cells with a chemical substance in solution. In one aspect of the 
invention that is thoroughly described in Appendix I, the chemical substances are attached 

15 to an underlying matrix. In this aspect, the chemical substances may also be produced and 
secreted from, or attached to the plasma membrane surfaces of, a sheet of genetically 
engineered cells. In this aspect of the invention the chemical substances may also have 
been separated two-dimensionally in a non-denaturing gel using electrophoresis and the 
gel is directly put in close proximity or direct contact with the mechanically intact or 

20 permeabilised living cells so that the chemical substances can contact the cells through 
diffusion or convection. 

The influence will modulate the intracellular processes. In one aspect the modulation could 
be an activation of the intracellular processes. In another aspect the modulation could be a 
deactivation of the intracellular processes. In yet another aspect, the influence could inhibit 
25 or promote the redistribution without directly affecting the metabolic activity of the 
component of the intracellular processes. 

In one embodiment the invention is used to establish a dose-response relationship for one 
or many chemical substances. In one embodiment the invention is used as a basis for a 
screening program, where the effect of unknown influences such as a compound library, 
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can be compared to influence of known reference compounds under standardised 
conditions. 

In addition to the intensity, there are several parameters of fluorescence or luminescence 
that can be modulated by the effect of the influence on the underlying cellular phenomena, 
5 and can therefore be used in the invention. Some examples are resonance energy transfer, 
fluorescence lifetime, polarisation, and wavelength shift. Each of these methods requires a 
particular kind of filter in the emission light path to select the component of the light 
desired and reject other components. The recording of property of light could be in the 
form of an ordered array of values such as a CCD array or a vacuum tube device such as a 
10 vidicon. In addition, the translational mobility, or freedom of movement, of the 

luminophore attached to the protein of interest can be an important property affected by 
the influence on the underlying cellular phenomena, and can therefore be used in he 
invention. 

In one embodiment of the invention, the spatially distributed light emitted by a 
15 luminophore is detected by a change in the resonance energy transfer between the 
luminophore and another luminescent entity capable of delivering energy to the 
luminophore, each of which has been selected or engineered to become part of, bound to or 
associated with particular components of the intracellular pathway. In this embodiment, 
either the luminophore or the luminescent entity capable of delivering energy to the 
20 luminophore undergoes redistribution in response to an influence. The resonance energy 
transfer would be measured as a change in the intensity of emission from the luminophore, 
preferably sensed by a single channel photodetector that responds only to the average 
intensity of the luminophore in a non-spatially resolved fashion. 

In one embodiment of the invention, the spatially distributed light emitted by a 
25 luminophore includes the case of uniform spatial distribution of the light. 

In one aspect of the invention, the luminophore is a fluorophore which redistributes 
through a non-homogenous excitation light field, resulting in a change in the intensity of 
the light emitted from the luminophore as a result of the change in the amount of excitation 
light intensity at different points in the field. 
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In one embodiment of the invention, the recording of the spatially distributed light could 
be made at a single point in time after the application of the influence. In another 
embodiment, the recording could be made at two points in time, one point being before, 
and the other point being after the application of the influence. The result or variation is 

5 determined from the change in fluorescence compared to the fluorescence measured prior 
to the influence or modulation. In another embodiment of the invention, the recording 
could be performed at a series of points in time, in which the application of the influence 
occurs at some time after the first time point in the series of recordings, the recording 
being performed, e.g., with a predetermined time spacing of from 0.1 seconds to 1 hour, 

10 preferably from 1 to 60 seconds, more preferably from 1 to 30 seconds, in particular from 
1 to 10 seconds, over a time span of from 1 second to 12 hours, such as from 10 seconds to 
12 hours, e.g., from 10 seconds to one hour, such as from 60 seconds to 30 minutes or 20 
minutes. The result or variation is determined from the change in fluorescence over time. 
The result or variation could also be determined as a change in the spatial distribution of 

15 the fluorescence over time. 

In one embodiment the recording comprises a time series of total luminescence of the cells 
of one or several of the spatial limitations. In one embodiment the signal from all of the 
spatial limitations, one at a time, is measured by a recording being made in the individual 
spatial limitations by means of an apparatus to sequentially position each one of the 

20 limitations in the field of view of the detector and repeating the positioning and 

measurement process until all of the spatial limitations have been measured. The detector 
may be a photomultiplier tube. In a preferred embodiment of the present invention more 
than one spatial limitation is measured simultaneously. This may be done by means of a 
one- or two-dimensional array detector, whereby the multiple spatial limitations are 

25 imaged onto the array detector such that discrete subsets of the detecting units (pixels) in 
the array detector measure the signal from one and only one of the multiple spatial 
limitations, the signal from any one spatial limitation being the combined signal from 
those pixels that receive the image from one of the spatial limitations. This array detector 
may be a linear diode array, a video camera (according to any present or future standards 

30 and definitions of image acquisition and transmission) or a charge transfer device such as a 
charge-coupled device (CCD). In one embodiment the recording of signal requires 
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illumination of the multiple spatial limitations to excite the luminophores so that they emit 
light. In one embodiment all of the spatial limitations are simultaneously illuminated 
during the measurement. In another embodiment the spatial limitations are singly 
illuminated only during the time in which they are being measured. In a preferred 
5 embodiment the illumination is provided by a laser that is scanned in a raster fashion over 
some or all of the spatial limitations being measured. The scanning may take place at a rate 
that is substantially faster than the measurement process such that the illumination appears 
to the measurement process to be continuous in time and spatially uniform over the region 
being measured. 

10 The recording of spatially distributed luminescence emitted from the luminophore is 

performed by an apparatus for measuring the distribution of fluorescence in the cells, and 
thereby any change in the distribution of fluorescence in the cells, which includes at a 
minimum the following component parts: (a) a light source, (b) a method for selecting the 
wavelength(s) of light from the source which will excite the luminescence of the 

15 luminophore, (c) a device which can rapidly block or pass the excitation light into the rest 
of the system, (d) a series of optical elements for conveying the excitation light to the 
specimen, collecting the emitted fluorescence in a spatially resolved fashion, and forming 
an image from this fluorescence emission (or another type of intensity map relevant to the 
method of detection and measurement), (e) a bench or stand which holds the container of 

20 the cells being measured in a predetermined geometry with respect to the series of optical 
elements, (f) a detector to record the spatially resolved fluorescence in the form of an 
image, (g) a computer or electronic system and associated software to acquire and store the 
recorded images, and to compute the degree of redistribution from the recorded images. 

In a preferred embodiment of the invention the apparatus system is automated. In one 
25 embodiment the components in d and e mentioned above comprise a fluorescence 

microscope. In one embodiment the component in f mentioned above is a CCD camera. In 
one embodiment the component in f mentioned above is an array of photomultiplier 
tubes/devices. 

In one embodiment the image is formed and recorded by an optical scanning system. 
30 In one embodiment the optical scanning system is used to illuminate the bottom of a plate 
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of microtiter type so that a time-resolved recording of changes in luminescence or 
fluorescence can be made from all spatial limitations simultaneously. 

In a preferred embodiment the actual luminescence or fluorescence measurements are 
made in a FLIPR™ instrument, commercially available from Molecular Devices, Inc. 

5 In one embodiment of the invention the actual fluorescence measurements are made in a 
standard type of fluorometer for plates of microtiter type (fluorescence plate reader). 

In one embodiment a liquid addition system is used to add a known or unknown compound 
to any or all of the cells in the cell holder at a time determined in advance. Preferably, the 
liquid addition system is under the control of the computer or electronic system. Such an 
10 automated system can be used for a screening program due to its ability to generate results 
from a larger number of test compounds than a human operator could generate using the 
apparatus in a manual fashion. 

The methods whereby the detector layer of cells are physically contacted by the 
compounds can also be of another conceptual type where the compounds are delivered to 
15 the cells through a porous membrane by convection/diffusion or by directly contacting 

compounds attached to an inorganic or organic support (such as glass, plastic or the plasma 
membrane of intact living cells) with the cells. These methods are thoroughly described in 
Appendix I, but are also outlined in the following paragraphs. 

In one aspect of the present invention where the detector layer of cells is a continuous or 
20 discontinuous sheet of cells without any separation into test units or wells. The compounds 
are printed onto a nonabsorbent sheet of porous material as a solution in solvent and 
allowed to dry. This printed sheet of compounds then defines the test pattern for the 
experiment as it is brought down in close proximity to or in direct contact with the 
underlying detector layer of cells. The compounds, now dissolved by the fluid layer on the 
25 cells, is brought in contact with the cells through the pores of the membrane by convection. 
The porous membrane onto which the compounds are printed is optically clear and 
preferably composed as stated in Appendix I. In another embodiment of this aspect of the 
present invention the detector layer of cells is a continuous or discontinuous sheet of cells, 
without any separation into test units or wells, growing on a porous and optically clear 
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membrane preferably of the types mentioned above. The porous membrane may allow the 
cells to send cellular processes through the pores of the membrane. The compounds are 
printed onto an optically clear substratum such as glass, plastic or quartz as solutions in 
solvent and allowed to dry. At the time of the experiment the cell sheet on the membrane, 
5 surrounded by a thin film of fluid, is layered ontop of the printed compound pattern. The 
compounds then dissolve and contact the cells via diffusion and convection. The 
compounds may be made using combinatorial chemistry techniques, and may be peptides. 
The compounds may be covalently attached to the optically clear substratum or porous 
membrane. The compounds may also be proteins, polypeptides or peptides secreted by or 
10 attached to the plasma membrane of genetically modified cells growing as a continuous or 
discontinuous sheet on a flat optically clear surface or an optically clear porous membrane. 

The recording of the variation or result with respect to light emitted from the luminophore 
is performed by recording the spatially distributed light as one or more digital images, and 
the processing of the recorded variation to reduce it to one or more numbers representative 

15 of the degree of redistribution comprises a digital image processing procedure or 

combination of digital image processing procedures. The quantitative information which is 
indicative of the degree of the cellular response to the influence or the result of the 
influence on the intracellular pathway is extracted from the recording or recordings 
according to a predetermined calibration based on responses or results, recorded in the 

20 same manner, to known degrees of a relevant specific influence. This calibration procedure 
is developed according to principles described below (Developing an Image-based Assay 
Technique). Specific descriptions of the procedures for particular assays are given in the 
examples. 

While the stepwise procedure necessary to reduce the image or images to the value 
25 representative of the response caused by the influence is particular to each assay, the 

individual steps are generally well-known methods of image processing. Some examples 
of the individual steps are point operations such as subtraction, ratioing, and thresholding, 
digital filtering methods such as smoothing, sharpening, and edge detection, spatial 
frequency methods such as Fourier filtering, image cross-correlation and image 
30 autocorrelation, object finding and classification (blob analysis), and colour space 
manipulations for visualisation. In addition to the algorithmic procedures, heuristic 
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methods such as neural networks may also be used. In a preferred embodiment of the 
invention, a dose-response relationship is established based on quantification of the 
responses caused by a particular influence, representative of the underlying intracellular 
signalling process, using the methods described above and in examples 1-22 and 25. The 
5 dose-response relationship for the particular influence is then compared to the dose- 
response relationship obtained by performing the same assay in an instrument which 
allows parallel monitoring of all wells in a microtiter plate such as a FLIPR™ or an 
ordinary fluorescence plate reader for microtiter plates. If a good correlation between the 
dose-response relationships obtained from the two different measurement systems is 
10 obtained, it can be said that the parallel measurement mode has been validated (see 

examples 23 and 24). This implies that it can be used as the primary basis for a screening 
assay with the potential benefit of screening a significantly higher number of substances 
per unit of time for their influence on the response. 

Imaging plate readers integrate the signal from each well into a single value per time point. 

15 Thus the data resulting from a single "run" of the instrument is a set of time series of 

single values, one for each well, with the injection of the test compound taking place at a 
known point in the time series. The primary advantage of this type of instrumentation is 
that it greatly increases the number of samples that can be processed in a given amount of 
time (the throughput). This is of great advantage when using the assay in a screening 

20 program for new pharmaceutical lead compounds. 

The first step in the data analysis is to normalise the results from each well so that they can 
be compared with each other or with previously analysed known compounds. This always 
begins with correcting the signal by subtracting the instrument bias from all data points on 
a well-by-well basis. From this point, either of two techniques can be followed depending 
25 on the design of the assay: 

Procedure 1 : The average of the signal prior to the addition of the test compound is 
subtracted from all data points on a well-by-well basis. 

Procedure 2: The data are corrected for any known background by subtracting the 
background value from all data points on a well-by-well basis. The resulting background- 
30 corrected data are normalised by dividing each data set by the average of the data values 
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prior to the injection of the test compound on a well-by-well basis. 

The corrected or normalised time series data sets are then further reduced by a technique 
that converts the time series to a single value. There are at least three such approaches: 

1 . For transient responses, the maximum deviation from the baseline is determined. This 
5 is also known as the "peak height" technique. 

2. Alternatively, the signal is integrated over time between pre-defined limits. If the data 
were treated according to Procedure 2 above, then the offset is subtracted such that the 
integral of a non-response is zero within the limit of measurement error. This is also 
known as the "peak area" technique. 

10 3. If the response is a cumulative one, e.g., an exponential change to a new level, the 
result is taken as the either the difference or the ratio between the signal after a 
predetermined time and the signal prior to the addition of the test compound. 

All of the above procedures reduce the data for a given well to one or more single values. 
For screening purposes, these values will be searched for those that are greater than a 
15 certain statistically determined cut-off value. For characterisation, the values represent a 
quantitative response, and are further treated in sets by techniques such as dose-response 
curve fitting. 

In another embodiment of the invention, the measurement of redistribution is 
accomplished indirectly by taking advantage of the fact that in order for redistribution to 

20 occur, the probe will experience some change in its freedom, or restriction, of movement 
within the intracellular milieu. The degree of translocation will correlate with the amount 
of freely mobile luminophore in the cytoplasm. At a point in time after the test compound 
has begun to have any influence it may have, the amount or fraction of restricted 
luminophore can be measured by disrupting or permeabilising the plasma membrane of the 

25 cells and allowing the freely mobile luminophore to diffuse away. If the detection volume 
of the detector is limited to the region immediately surrounding the cells, and the overall 
volume into which the freely mobile luminophore can diffuse is much larger, then the 
freely mobile luminophore essentially disappears from the detector's view and its signal is 
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not recorded. 

In one aspect of the invention, the above mentioned measurement of redistribution is made 
on cells with permanently permeabilised plasma membranes immersed in a solution 
mimicking the cytoplasmic environment. In this way the influence of compounds that can 
5 normally not enter the cytoplasm of cells can be tested. 

The nucleic acid constructs used in the present invention encode in their nucleic acid 
sequences fusion polypeptides comprising a biologically active polypeptide that is a 
component of an intracellular signalling pathway, or a part thereof, and a GFP, preferably 
an F64L mutant of GFP, N- or C-terminally fused, optionally via a peptide linker, to the 
10 biologically active polypeptide or part thereof. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a protein kinase or a phosphatase. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a transcription factor or a part thereof which changes cellular localisation upon 
15 activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a protein, or a part thereof, which is associated with the cytoskeletal network 
and which changes cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
20 construct is a protein kinase or a part thereof which changes cellular localisation upon 
activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a serine/threonine protein kinase or a part thereof capable of changing 
intracellular localisation upon activation. 

25 In one embodiment the biologically active polypeptide encoded by the nucleic acid 

construct is a tyrosine protein kinase or a part thereof capable of changing intracellular 
localisation upon activation. 
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In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a phospholipid-dependent serine/threonine protein kinase or a part thereof 
capable of changing intracellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
5 construct is a cAMP-dependent protein kinase or a part thereof capable of changing 
cellular localisation upon activation. In a preferred embodiment the biologically active 
polypeptide encoded by the nucleic acid construct is a PKAc-F64L-S65T-GFP fusion. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a cGMP-dependent protein kinase or a part thereof capable of changing 
10 cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a calmodulin-dependent serine/threonine protein kinase or a part thereof 
capable of changing cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
15 construct is a mitogen-activated serine/threonine protein kinase or a part thereof capable of 
changing cellular localisation upon activation. In preferred embodiments the biologically 
active polypeptide encoded by the nucleic acid constructs are an ERK1-F64L-S65T-GFP 
fusion or an EGFP-ERK1 fusion. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
20 construct is a cyclin-dependent serine/threonine protein kinase or a part thereof capable of 
changing cellular localisation upon activation. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
construct is a protein phosphatase or a part thereof capable of changing cellular 
localisation upon activation. 

25 In one preferred embodiment of the invention the nucleic acid constructs may be DNA 
constructs. 

In one embodiment the biologically active polypeptide encoded by the nucleic acid 
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construct. In one embodiment the gene encoding GFP in the nucleic acid construct is 
derived from Aequorea victoria. In a preferred embodiment the gene encoding GFP in the 
nucleic acid construct is EGFP or a GFP variant selected from F64L-GFP, F64L-Y66H- 
GFP and F64L-S65T-GFP. 

5 In preferred embodiments of the invention the DNA constructs which can be identified by 
any of the DNA sequences shown in SEQ ID NO: 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 108, 1 10, 1 12, 1 14, 1 16, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, and 152 or are variants of these 
sequences capable of encoding the same fusion polypeptide or a fusion polypeptide which 

10 is biologically equivalent thereto, e.g. an isoform, or a splice variant or a homologue from 
another species. 

The present invention describes a method that may be used to establish a screening 
program for the identification of biologically active substances that directly or indirectly 
affects intracellular signalling pathways and because of this property are potentially useful 
15 as medicaments. Based on measurements in living cells of the redistribution of spatially 
resolved luminescence from luminophores which undergo a change in distribution upon 
activation or deactivation of an intracellular signalling pathway the result of the individual 
measurement of each substance being screened indicates its potential biological activity. 

In one embodiment of the invention the screening program is used for the identification of 
20 a biologically toxic substance as defined herein that exerts its toxic effect by interfering 
with an intracellular signalling pathway. Based on measurements in living cells of the 
redistribution of spatially resolved luminescence from luminophores which undergo a 
change in distribution upon activation or deactivation of an intracellular signalling 
pathway the result of the individual measurement of each substance being screened 
25 indicates its potential biologically toxic activity. In one embodiment of a screening 

program a compound that modulates a component of an intracellular pathway as defined 
herein, can be found and the therapeutic amount of the compound estimated by a method 
according to the method of the invention. In a preferred embodiment the present invention 
leads to the discovery of a new way of treating a condition or disease related to the 
30 intracellular function of a biologically active polypeptide comprising administration to a 
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patient suffering from said condition or disease of an effective amount of a compound 
which has been discovered by any method according to the invention. In another preferred 
embodiment of the invention a method is established for identification of a new drug target 
or several new drug targets among the group of biologically active polypeptides which are 
5 components of intracellular signalling pathways. 

In another embodiment of the invention an individual treatment regimen is established for 
the selective treatment of a selected patient suffering from an ailment where the available 
medicaments used for treatment of the ailment are tested on a relevant primary cell or cells 
obtained from said patient from one or several tissues, using a method comprising 

10 transfecting the cell or cells with at least one DNA sequence encoding a fluorescent probe 
according to the invention, transferring the transfected cell or cells back the said patient, or 
culturing the cell or cells under conditions permitting the expression of said probes and 
exposing it to an array of the available medicaments, then comparing changes in 
fluorescence patterns or redistribution patterns of the fluorescent probes in the intact living 

15 cells to detect the cellular response to the specific medicaments (obtaining a cellular action 
profile), then selecting one or more medicament or medicaments based on the desired 
activity and acceptable level of side effects and administering an effective amount of these 
medicaments to the selected patient. 

The present invention describes a method that may be used to establish a screening 
20 program for back-tracking signal transduction pathways as defined herein. In one 

embodiment the screening program is used to establish more precisely at which level one 
or several compounds affect a specific signal transduction pathway by successively or in 
parallel testing the influence of the compound or compounds on the redistribution of 
spatially resolved luminescence from several of the luminophores which undergo a change 
25 in distribution upon activation or deactivation of the intracellular signalling pathway under 
study. 

In general, a probe, i.e. a "GeneX"-GFP fusion or a GFP-"GeneX" fusion, is constructed 
using PCR with "GeneX"-specific primers followed by a cloning step to fuse "GeneX" in 
frame with GFP. The fusion may contain a short vector derived sequence between 
30 "GeneX" and GFP (e.g. part of a multiple cloning site region in the plasmid) resulting in a 
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peptide linker between "GeneX" and GFP in the resulting fusion protein. 

Some of the steps involved in the development of a probe include the following: 

Identify the sequence of the gene. This is most readily done by searching a depository 
of genetic information, e.g. the GenBank Sequence Database, which is widely 
5 available and routinely used by molecular biologists. In the specific examples below 

the GenBank Accession number of the gene in question is provided. 

Design the gene-specific primers. Inspection of the sequence of the gene allows design 
of gene-specific primers to be used in a PCR reaction. Typically, the top-strand primer 
encompasses the ATG start codon of the gene and the following ca. 20 nucleotides, 

10 while the bottom-strand primer encompasses the stop codon and the ca. 20 preceding 

nucleotides, if the gene is to be fused behind GFP, i.e. a GFP-"GeneX" fusion. If the 
gene is to be fused in front of GFP, i.e. a "GeneX"-GFP fusion, a stop codon must be 
avoided. Optionally, the full-length sequence of GeneX may not be used in the fusion, 
but merely the part that localizes and redistributes like GeneX in response to a signal. 

15 In addition to gene-specific sequences, the primers contain at least one recognition 

sequence for a restriction enzyme, to allow subsequent cloning of the PCR product. 
The sites are chosen so that they are unique in the PCR product and compatible with 
sites in the cloning vector. Furthermore, it may be necessary to include an exact 
number of nucleotides between the restriction enzyme site and the gene-specific 

20 sequence in order to establish the correct reading frame of the fusion gene and/or a 

translation initiation consensus sequence. Lastly, the primers always contain a few 
nucleotides in front of the restriction enzyme site to allow efficient digestion with the 
enzyme. 

Identify a source of the gene to be amplified. In order for a PCR reaction to produce a 
25 product with gene-specific primers, the gene-sequence must initially be present in the 

reaction, e.g. in the form of cDNA. Information in GenBank or the scientific literature 
will usually indicate in which tissue(s) the gene is expressed, and cDNA libraries from 
a great variety of tissues or cell types from various species are commercially available, 
e.g. from Clontech (Palo Alto), Stratagene (La Jolla) and Invitrogen (San Diego). 
30 Many genes are also available in cloned form from The American Type Tissue 
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Collection (Virginia). 

Optimise the PCR reaction. Several factors are known to influence the efficiency and 
specificity of a PCR reaction, including the annealing temperature of the primers, the 
concentration of ions, notably Mg 2+ and K\ present in the reaction, as well as pH of the 
5 reaction. If the result of a PCR reaction is deemed unsatisfactory, it might be because 

the parameters mentioned above are not optimal. Various annealing temperatures 
should be tested, e.g. in a PCR machine with a built-in temperature gradient, available 
from e.g. Stratagene (La Jolla), and/or various buffer compositions should be tried, e.g. 
the OptiPrime buffer system from Stratagene (La Jolla). 

10 - Clone the PCR product. The vector into which the amplified gene product will be 

cloned and fused with GFP will already have been taken into consideration when the 
primers were designed. When choosing a vector, one should at least consider in which 
cell types the probe subsequently will be expressed, so that the promoter controlling 
expression of the probe is compatible with the cells. Most expression vectors also 

15 contain one or more selective markers, e.g. conferring resistance to a drug, which is a 

useful feature when one wants to make stable transfectants. The selective marker 
should also be compatible with the cells to be used. 

The actual cloning of the PCR product should present no difficulty as it typically will be a 
one-step cloning of a fragment digested with two different restriction enzymes into a 

20 vector digested with the same two enzymes. If the cloning proves to be problematic, it may 
be because the restriction enzymes did not work well with the PCR fragment. In this case 
one could add longer extensions to the end of the primers to overcome a possible difficulty 
of digestion close to a fragment end, or one could introduce an intermediate cloning step 
not based on restriction enzyme digestion. Several companies offer systems for this 

25 approach, e.g. Invitrogen (San Diego) and Clontech (Palo Alto). 

Once the gene has been cloned and, in the process, fused with the GFP gene, the resulting 
product, usually a plasmid, should be carefully checked to make sure it is as expected. The 
most exact test would be to obtain the nucleotide sequence of the fusion-gene. 

Once a DNA construct for a probe has been generated, its functionality and usefulness may 
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be evaluated by transfecting it into cells capable of expressing the probe. The fluorescence 
of the cell is inspected soon after, typically the next day. At this point, two features of 
cellular fluorescence are noted: the intensity and the sub-cellular localisation. 

The intensity should usually be at least as strong as that of unfused GFP in the cells. If it is 
5 not, the sequence or quality of the probe-DNA might be faulty, and should be carefully 
checked. 

The sub-cellular localisation is an indication of whether the probe is likely to perform well. 
If it localises as expected for the gene in question, e.g. is excluded from the nucleus, it can 
immediately go on to a functional test. If the probe is not localised soon after the 

10 transfection procedure, it may be because of overexpression at this point in time, as the 
cell typically will have taken up very many copies of the plasmid, and localisation will 
occur in time, e.g. within a few weeks, as plasmid copy number and expression level 
decreases. If localisation does not occur after prolonged time, it may be because the fusion 
to GFP has destroyed a localisation function, e.g. masked a protein sequence essential for 

15 interaction with its normal cellular anchor-protein. In this case the opposite fusion might 
work, e.g. if GeneX-GFP does not work, GFP-GeneX might, as two different parts of 
GeneX will be affected by the proximity to GFP. If this does not work, the proximity of 
GFP at either end might be a problem, and it could be attempted to increase the distance by 
incorporating a longer linker between GeneX and GFP in the DNA construct. 

20 If there is no prior knowledge of localisation, and no localisation is observed, it may be 
because the probe should not be localised at this point, because such is the nature of the 
protein fused to GFP. It should then be subjected to a functional test. 

In a functional test, the cells expressing the probe are treated with at least one compound 
known to perturb, usually by activating, the signalling pathway on which the probe is 
25 expected to report by redistributing itself within the cell. If the redistribution is as 

expected, e.g. if prior knowledge tell that it should translocate from location X to location 
Y, it has passed the first critical test. In this case it can go on to further characterisation and 
quantification of the response. 

If it does not perform as expected, it may be because the cell lacks at least one component 
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of the signalling pathway, e.g. a cell surface receptor, or there is species incompatibility, 
e.g. if the probe is modelled on sequence information of a human gene product, and the 
cell is of hamster origin. In both instances one should identify other cell types for the 
testing process where these potential problems would not apply. 

5 If there is no prior knowledge about the pattern of redistribution, the analysis of the 
redistribution will have to be done in greater depth to identify what the essential and 
indicative features are, and when this is clear, it can go on to further characterisation and 
quantification of the response. If no feature of redistribution can be identified, the problem 
might be as mentioned above, and the probe should be retested under more optimal cellular 
10 conditions. 

If the probe does not perform under optimal cellular conditions, then it's back to the 
drawing board. 

The process of developing an image-based redistribution assay begins with either the 
unplanned experimental observation that a redistribution phenomenon can be visualised, or 

15 the design of a probe specifically to follow a redistribution phenomenon already known to 
occur. In either event, the first and best exploratory technique is for a trained scientist or 
technician to observe the phenomenon. Even with the rapid advances in computing 
technology, the human eye-brain combination is still the most powerful pattern recognition 
system known, and requires no advance knowledge of the system in order to detect 

20 potentially interesting and useful patterns in raw data. This is especially if those data are 
presented in the form of images, which are the natural "data type" for human visual 
processing. Because human visual processing operates most effectively in a relatively 
narrow frequency range, i.e., we cannot see either very fast or very slow changes in our 
visual field, it may be necessary to record the data and play it back with either time 

25 dilation or time compression. 

Some luminescence phenomena cannot be seen directly by the human eye. Examples 
include polarisation and fluorescence lifetime. However, with suitable filters or detectors, 
these signals can be recorded as images or sequences of images and displayed to the 
human in the fashion just described. In this way, patterns can be detected and the same 
30 methods can be applied. 
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Once the redistribution has been determined to be a reproducible phenomenon, one or 
more data sets are generated for the purpose of developing a procedure for extracting the 
quantitative information from the data. In parallel, the biological and optical conditions are 
determined which will give the best quality raw data for the assay. This can become an 
iterative process; it may be necessary to develop a quantitative procedure in order to assess 
the effect on the assay of manipulating the assay conditions. 

The data sets are examined by a person or persons with knowledge of the biological 
phenomenon and skill in the application of image processing techniques. The goal of this 
exercise is to determine or at least propose a method that will reduce the image or 
sequence of images constituting the record of a "response" to a value corresponding to the 
degree of the response. Using either interactive image processing software or an image 
processing toolbox and a programming language, the method is encoded as a procedure or 
algorithm that takes the image or images as input and generates the degree of response (in 
any units) as its output. Some of the criteria for evaluating the validity of a particular 
procedure are: 

• Does the degree of the response vary in a biologically significant fashion, i.e., does 
it show the known or putative dependence on the concentration of the stimulating 
agent or condition? 

• Is the degree of response reproducible, i.e., does the same concentration or level of 
stimulating agent or condition give the same response with an acceptable variance? 

• Is the dynamic range of the response sufficient for the purpose of the assay? If not, 
can a change in the procedure or one of its parameters improve the dynamic range? 

• Does the procedure exhibit any clear "pathologies", i.e., does it give ridiculous 
values for the response if there are commonly occurring imperfections in the 
imaging process? Can these pathologies be eliminated, controlled, or accounted 
for? 

• Can the procedure deal with the normal variation in the number and/or size of cells 
in an image? 
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In some cases the method may be obvious; in others, a number of possible procedures may 
suggest themselves. Even if one method appears clearly superior to others, optimisation of 
parameters may be required. The various procedures are applied to the data set and the 
criteria suggested above are determined, or the single procedure is applied repeatedly with 
5 adjustment of the parameter or parameters until the most satisfactory combination of 

signal, noise, range, etc. are arrived at. This is equivalent to the calibration of any type of 
single-channel sensor. 

The number of ways of extracting a single value from an image are extremely large, and 
thus an intelligent approach must be taken to the initial step of reducing this number to a 
10 small, finite number of possible procedures. This is not to say that the procedure arrived at 
is necessarily the best procedure - but a global search for the best procedure is simply out 
of the question due to the sheer number of possibilities involved. 

Image-based assays are no different than other assay techniques in that their usefulness is 
characterised by parameters such as the specificity for the desired component of the 
15 sample, the dynamic range, the variance, the sensitivity, the concentration range over 
which the assay will work, and other such parameters. While it is not necessary to 
characterise each and every one of these before using the assay, they represent the only 
way to compare one assay with another. 

The final step is then to see whether there exists a possibility to increase the throughput of 
20 the assay to improve its utility as the basis of a screening program. In order to do this, a 
dose-response relationship is established based on quantification of the responses caused 
by a particular influence, representative of the underlying intracellular signalling process, 
using the methods described above and in examples 1-22 and 25. The dose-response 
relationship for the particular influence is then compared to the dose-response relationship 
25 obtained by performing the same assay in an instrument which allows parallel monitoring 
of all wells in a microtiter plate such as a FLIPR™ or an ordinary imaging or fluorescence 
plate reader for microtiter plates. If a good correlation between the dose-response 
relationships obtained from the two different measurement systems is obtained, it can be 
said that the parallel measurement mode has been validated (see examples 23 and 24). This 
30 implies that it can be used as the primary basis for a screening program with the potential 
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benefit of screening a significantly higher number of substances for their influence on the 
response per unit of time. 

The process of developing an image-based assay is best illustrated by example. The 
development of such an assay for GLUT4 translocation is hereby described. GLUT4 is a 
5 member of the class of glucose transporter molecules that are important in cellular glucose 
uptake. It is known to translocate to the plasma membrane under some conditions of 
stimulation of glucose uptake. The ability to visualise the glucose uptake response non- 
invasively, without actually measuring glucose uptake, would be a very useful assay for 
anyone looking for, for example, treatments for type II diabetes. 

10 A CHO cell line which stably expressed the human insulin receptor was used as the basis 
for a new cell line which stably expressed a fusion between GLUT4 and GFP. This cell 
line was expected to show translocation of GLUT4 to the plasma membrane as visualised 
by the movement of the GFP. The translocation could definitely be seen in the form of the 
appearance of local increases in the fluorescence in regions of the plasma membrane which 

15 had a characteristic shape or pattern. This is shown in Figure 12. 

These objects became known as "snircles", and the phenomenon of their appearance as 
"snircling" In order to quantify their appearance, a method had to be found to isolate them 
as objects in the image field, and then enumerate them, measure their area, or determine 
some parameter about them which correlated in a dose-dependent fashion with the 

20 concentration of insulin to which the cells had been exposed. In order to separate the 

snircles, a binarization procedure was applied in which one copy of the image smoothed 
with a relatively severe gaussian kernel (sigma = 2.5) was subtracted from another copy to 
which only a relatively light gaussian smooth had been applied (sigma=0.5). The resultant 
image was rescaled to its min/max range, and an automatic threshold was applied to divide 

25 the image into two levels. The thresholded image contains a background of one value all 
found object with another value. The found objects were first filtered through a filter to 
remove objects far too large and far too small to be snircles. The remaining objects, which 
represent snircles and other artifacts from the image with approximately the same size and 
intensity characteristics as snircles, are passed into a classification procedure which has 

30 been previously trained with many images of snircles to recognize snircles and exclude the 
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other artifacts. The result of this procedure is a binary image that shows only the found 
snircles to the degree to which the classification procedure can accurately identify them. 
The total area of the snircles is then summed and this value is the quantitative measure of 
the degree of snircling for that image. 

Another approach to the problem of quantifying GLUT 4 translocation has been performed 
and validated using the same type of experimental protocol but a different image 
processing approach. In this case the objects of interest in the cells are not the appearance 
of snircles at the plasma membrane but the disappearance of GLUT4-GFP fluorescence 
from its intracellular site. With this method the bright area, consisting of GLUT4-GFP, 
centrally located in each cell is identified by a thresholding procedure. This demarcates a 
certain area for the centrally located GLUT4-GFP. In the next step the total fluorescence 
intensity in this area is quantified on each image in the image series, i.e. over time. The 
response for each cell is defined as the difference in fluorescence intensity in the centrally 
located GLUT4-GFP area before and a fixed point in time after application of the 
influence. The dose-response relationship for insulin using the above described 
quantitation procedure is shown in Figure 13. It can be seen that the ED50 value for insulin 
to reduce central GLUT4-GFP fluorescence is 0.3 nM. 

In the present specification and claims, the term "an influence" covers any influence to 
which the cellular response comprises a redistribution. Thus, e.g., heating, cooling, high 
pressure, low pressure, humidifying, or drying are influences on the cellular response on 
which the resulting redistribution can be quantified, but as mentioned above, perhaps the 
most important influences are the influences of contacting or incubating the cells with 
substances which are known or suspected to exert an influence on the cellular response 
involving a redistribution contribution. In another embodiment of the invention the 
influence could be substances from a compound drug library. 

In the present context, the term "green fluorescent protein" is intended to indicate a protein 
which, when expressed by a cell, emits fluorescence upon exposure to light of the correct 
excitation wavelength (cf. [(Chalfie, M. el al (1994) Science 263, 802-805)]). In the 
following, GFP in which one or more amino acids have been substituted, inserted or 
deleted is most often termed "modified GFP". "GFP" as used herein includes wild-type 
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GFP derived from the jelly fish Aequorea victoria and modifications of GFP, such as the 
blue fluorescent variant of GFP disclosed by Heim et al (1994). Proc.Natl.Acad.Sci. 91:26, pp 
12501-12504, and other modifications that change the spectral properties of the GFP 
fluorescence, or modifications that exhibit increased fluorescence when expressed in cells 
5 at a temperature above about 30°C described in PCT/DK96/00051, published as WO 

97/1 1094 on 27 March 1997 and hereby incorporated by reference, and which comprises a 
fluorescent protein derived from Aequorea Green Fluorescent Protein (GFP) or any functional 
analogue thereof, wherein the amino acid in position 1 upstream from the chromophore has 
been mutated to provide an increase of fluorescence intensity when the fluorescent protein of 
10 the invention is expressed in cells. Preferred GFP variants are F64L-GFP, F64L-Y66H-GFP 
and F64L-S65T-GFP. An especially preferred variant of GFP for use in all the aspects of 
this invention is EGFP (DNA encoding EGFP which is a F64L-S65T variant with codons 
optimized for expression in mammalian cells is available from Clontech, Palo Alto, 
plasmids containing the EGFP DNA sequence, cf. GenBank Acc. Nos. U55762, U55763). 

15 The term "intracellular signalling pathway" and "signal transduction pathway" are 
intended to indicate the co-ordinated intracellular processes whereby a living cell 
transduce an external or internal signal into cellular responses. Said signal transduction 
will involve an enzymatic reaction said enzymes include but are not limited to protein 
kinases, GTPases, ATPases, protein phosphatases, phospholipases and cyclic nucleotide 

20 phosphodiesterases. The cellular responses include but are not limited to gene 

transcription, secretion, proliferation, mechanical activity, metabolic activity, cell death. 

The term "second messenger" is used to indicate a low molecular weight component 
involved in the early events of intracellular signal transduction pathways. 

The term "luminophore" is used to indicate a chemical substance that has the property of 
25 emitting light either inherently or upon stimulation with chemical or physical means. This 
includes but is not limited to fluorescence, bioluminescence, phosphorescence, and 
chemiluminescence. 

The term "mechanically intact living cell" is used to indicate a cell which is considered 
living according to standard criteria for that particular type of cell such as maintenance of 
30 normal membrane potential, energy metabolism, proliferative capability, and has not 
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experienced any physically invasive treatment designed to introduce external substances 
into the cell such as microinjection. 

In the present context, the term "permeabilised living ceir is used to indicate cells where a 
pore forming agent such as Streptolysin O or Staphylococcus Aureus a-toxin has been 
5 applied and thereby incorporated into the plasma membrane in the cells. This creates 
proteinaceous pores with a defined pore size in the plasma membranes of the exposed 
cells. Pores could also be made by electroporation, i.e. exposing the cells to high voltage 
discharges, a procedure that creates small holes in the plasma membrane by coagulating 
integral membrane proteins. Treatment with a mild detergent such as saponin may 

10 accomplish the same thing. Common to all these treatments are that pores are formed only 
in the plasma membrane without affecting the integrity of cytoplasmic structural elements 
and organelles. The term living in this context means that the permeabilised cells bathed in 
a solution mimicking the intracellular milieu still have functional organelles, such as 
actively respiring mitochondria and endoplasmic reticulum that can take up and release 

15 calcium ions, and functional structural elements. The benefit of this method is that 

substances that normally can not traverse the plasma membrane, but most likely exert their 
influence intracellularly, can be introduced and their influence studied without 
cumbersome microinjection of the substances into single cells. Using this method the 
response to an influence can be recorded from many cells simultaneously. 

20 In the present context, the term "permeabilisation" is intended to indicate the selective 

disruption of the plasma membrane barrier so that soluble substances freely mobile in the 
cytosol are lost from the cells. The permeabilisation can be achieved as described above 
under "permeabilised living cells" or by using other chemical detergents such as Triton X- 
100 or digitonin in carefully titrated amounts. 

25 The term "physiologically relevant", when applied to an experimentally determined 

redistribution of an intracellular component, as measured by a change in the luminescence 
properties or distribution, is used to indicate that said redistribution can be explained in 
terms of the underlying biological phenomenon which gives rise to the redistribution. 

The terms "image processing" and "image analysis" are used to describe a large family of 
30 digital data analysis techniques or combination of such techniques which reduce ordered 
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arrays of numbers (images) to quantitative information describing those ordered arrays of 
numbers. When said ordered arrays of numbers represent measured values from a physical 
process, the quantitative information derived is therefore a measure of the physical 
process. 

5 The term "fluorescent probe" is used to indicate a fluorescent fusion polypeptide 

comprising a GFP or any functional part thereof which is N- or C-terminally fused to a 
biologically active polypeptide as defined herein, optionally via a peptide linker consisting 
of one or more amino acid residues, where the size of the linker peptide in itself is not 
critical as long as the desired functionality of the fluorescent probe is maintained. A 
10 fluorescent probe according to the invention is expressed in a cell and basically mimics the 
physiological behaviour of the biologically active polypeptide moiety of the fusion 
polypeptide. 

The term "mammalian cell" is intended to indicate any living cell of mammalian origin. 
The cell may be an established cell line, many of which are available from The American 

15 Type Culture Collection (ATCC, Virginia, USA) or a primary cell with a limited life span 
derived from a mammalian tissue, including tissues derived from a transgenic animal, or a 
newly established immortal cell line derived from a mammalian tissue including 
transgenic tissues, or a hybrid cell or cell line derived by fusing different cell types of 
mammalian origin e.g. hybridoma cell lines. The cells may optionally express one or more 

20 non-native gene products, e.g. receptors, enzymes, enzyme substrates, prior to or in 

addition to the fluorescent probe. Preferred cell lines include but are not limited to those of 
fibroblast origin, e.g. BHK, CHO, BALB, or of endothelial origin, e.g. HUVEC, BAE 
(bovine artery endothelial), CPAE (cow pulmonary artery endothelial), HLMVEC (human 
lung microvascular endothelial cells) or of pancreatic origin, e.g. RIN, INS-1, MIN6, 

25 bTC3, aTC6, bTC6, HIT, or of hematopoietic origin, e.g. primary isolated human 

monocytes, macrophages, neutrophils, basophils, eosinophils and lyphocyte populations, 
AML-193, HL-60, RBL-1, adipocyte origin, e.g. 3T3-L1, neuronal/neuroendocrine origin, 
e.g. AtT20, PC12, GH3, muscle origin, e.g. SKMC, A10, C2C12, renal origin, e.g. HEK 
293, LLC-PK1. 

30 The term "hybrid polypeptide" is intended to indicate a polypeptide which is a fusion of at 
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least a portion of each of two proteins, in this case at least a portion of the green 
fluorescent protein, and at least a portion of a catalytic and/or regulatory domain of a 
protein kinase. Furthermore a hybrid polypeptide is intended to indicate a fusion 
polypeptide comprising a GFP or at least a portion of the green fluorescent protein that 
contains a functional fluorophore, and at least a portion of a biologically active 
polypeptide as defined herein provided that said fusion is not the PKCa-GFP, PKCy-GFP, 
and PKCe-GFP disclosed by Schmidt et al. and Sakai et al., respectively. Thus, GFP may 
be N- or C-terminally tagged to a biologically active polypeptide, optionally via a linker 
portion or linker peptide consisting of a sequence of one or more amino acids. The hybrid 
polypeptide or fusion polypeptide may act as a fluorescent probe in intact living cells 
carrying a DNA sequence encoding the hybrid polypeptide under conditions permitting 
expression of said hybrid polypeptide. 

The term "kinase" is intended to indicate an enzyme that is capable of phosphorylating a 
cellular component. 

The term "protein kinase" is intended to indicate an enzyme that is capable of 
phosphorylating serine and/or threonine and/or tyrosine in peptides and/or proteins. 

The term "phosphatase" is intended to indicate an enzyme that is capable of 
dephosphorylating phosphoserine and/or phosphothreonine and/or phosphotyrosine in 
peptides and/or proteins. 

20 The term "cyclic nucleotide phosphodiesterase" is intended to indicate an enzyme that is 
capable of inactivating the second messengers cAMP and cGMP by hydrolysis of their 3'- 
ester bond. 

In the present context, the term "biologically active polypeptide" is intended to indicate a 
polypeptide affecting intracellular processes upon activation, such as an enzyme which is 
active in intracellular processes or a portion thereof comprising a desired amino acid 
sequence which has a biological function or exerts a biological effect in a cellular system. 
In the polypeptide one or several amino acids may have been deleted, inserted or replaced 
to alter its biological function, e.g. by rendering a catalytic site inactive. Preferably, the 
biologically active polypeptide is selected from the group consisting of proteins taking part 
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in an intracellular signalling pathway, such as enzymes involved in the intracellular 
phosphorylation and dephosphorylation processes including kinases, protein kinases and 
phosphorylases as defined herein, but also proteins making up the cytoskeleton play 
important roles in intracellular signal transduction and are therefore included in the 
5 meaning of "biologically active polypeptide" herein. More preferably, the biologically 
active polypeptide is a protein which according to its state as activated or non-activated 
changes localisation within the cell, preferably as an intermediary component in a signal 
transduction pathway. Included in this preferred group of biologically active polypeptides 
are cAMP dependent protein kinase A. 

10 The term "a substance having biological activity" is intended to indicate any sample that 
has a biological function or exerts a biological effect in a cellular system. The sample may 
be a sample of a biological material such as a sample of a body fluid including blood, 
plasma, saliva, milk, urine, or a microbial or plant extract, an environmental sample 
containing pollutants including heavy metals or toxins, or it may be a sample containing a 

15 compound or mixture of compounds prepared by organic synthesis or genetic techniques. 

The phrase "any change in fluorescence" means any change in absorption properties, such 
as wavelength and intensity, or any change in spectral properties of the emitted light, such 
as a change of wavelength, fluorescence lifetime, intensity or polarisation, or any change 
in the intracellular localisation of the fluorophore. It may thus be localised to a specific 
20 cellular component (e.g. organelle, membrane, cytoskeleton, molecular structure) or it may 
be evenly distributed throughout the cell or parts of the cell. 

The term "organism" as used herein indicates any unicellular or multicellular organism 
preferably originating from the animal kingdom including protozoans, but also organisms 
that are members of the plant kingdoms, such as algae, fungi, bryophytes, and vascular 
25 plants are included in this definition. 

The term "nucleic acid" is intended to indicate any type of poly- or oligonucleic acid 
sequence, such as a DNA sequence, a cDNA sequence, or an RNA sequence. 

The term "biologically equivalent" as it relates to proteins is intended to mean that a first 
protein is equivalent to a second protein if the cellular functions of the two proteins may 
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substitute for each other, e.g. if the two proteins are closely related isoforms encoded by 
different genes, if they are splicing variants, or allelic variants derived from the same gene, 
if they perform identical cellular functions in different cell types, or in different species. 
The term "biologically equivalent" as it relates to DNA is intended to mean that a first 
5 DNA sequence encoding a polypeptide is equivalent to a second DNA sequence encoding 
a polypeptide if the functional proteins encoded by the two genes are biologically 
equivalent. 

The phrase "back-tracking of a signal transduction pathway" is intended to indicate a 
process for defining more precisely at what level a signal transduction pathway is affected, 

10 either by the influence of chemical compounds or a disease state in an organism. Consider 
a specific signal transduction pathway represented by the bioactive polypeptides A - B - C 
- D, with signal transduction from A towards D. When investigating all components of this 
signal transduction pathway compounds or disease states that influence the activity or 
redistribution of only D can be considered to act on C or downstream of C whereas 

15 compounds or disease states that influence the activity or redistribution of C and D, but not 
of A and B can be considered to act downstream of B. 

The term "fixed cells" is used to mean cells treated with a cytological fixative such as 
glutaraldehyde or formaldehyde, treatments that serve to chemically cross-link and 
stabilise soluble and insoluble proteins within the structure of the cell. Once in this state, 
20 such proteins cannot be lost from the structure of the now-dead cell. 

In the present context a "screening assay" is intended to mean any measurement protocol, 
including materials, cells, instruments, chemicals, reagents, detection units, calibration and 
quantification procedures used to measure a response from mechanically intact or 
permeabilised living cells relevant to influences on an intracellular pathway. 

25 The term "dose-response relationship" and "screening programme" is in the present 

context intended to mean a clear correlation between the quantified response of cells in a 
screening assay to application of an influence, such as a compound, and the concentration 
of the applied influence. The response to the influence may be both an up-regulation and a 
down-regulation of the quantified parameter used in the screening assay. 
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In the present context, the term "physiology" is intended to mean the normal function of 
biological and biochemical processes inside cells, between cells and in the whole organism 
or animal. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. CHO cells expressing the PKAc-F64L-S65T-GFP hybrid protein have been 
treated in HAM's F12 medium with 50 jaM forskolin at 37°C. The images of the GFP 
fluorescence in these cells have been taken at different time intervals after treatment, 
which were: a) 40 seconds b) 60 seconds c) 70 seconds d) 80 seconds. The fluorescence 
10 changes from a punctate to a more even distribution within the (non-nuclear) cytoplasm. 

Figure 2. Time-lapse analysis of forskolin induced PKAc-F64L-S65T-GFP redistribution. 
CHO cells, expressing the PKAc-F64L-S65T-GFP fusion protein were analysed by time- 
lapse fluorescence microscopy. Fluorescence micrographs were acquired at regular 
15 intervals from 2 min before to 8 min after the addition of agonist. The cells were 
challenged with 1 |aM forskolin immediately after the upper left image was acquired (t=0). 
Frames were collected at the following times: i) 0, ii) 1, iii) 2, iv) 3, v) 4 and vi) 5 minutes. 
Scale bar 10 (am. 

20 Figure 3. Time-lapse analyses of PKAc-F64L-S65T-GFP redistribution in response to 
various agonists. The effects of 1 jaM forskolin (A), 50 |iM forskolin (B), ImM dbcAMP 
(C) and 100 |iM IBMX (D) (additions indicated by open arrows) on the localisation of the 
PKAc-F64L-S65T-GFP fusion protein were analysed by time-lapse fluorescence 
microscopy of CHO/PKAc-F64L-S65T-GFP cells. The effect of addition of 10 \iM 
25 forskolin (open arrow), followed shortly by repeated washing with buffer (solid arrow), on 
the localisation of the PKAc-F64L-S65T-GFP fusion protein was analysed in the same 
cells (E). In a parallel experiment, the effect of adding 10 \aM forskolin and 100 (iM 
IBMX (open arrow) followed by repeated washing with buffer containing 100 |iM IBMX 
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(solid arrow) was analysed (F). Removing forskolin caused PKAc-F64L-S65T-GFP fusion 
protein to return to the cytoplasmic aggregates while this is prevented by the continued 
presence of IBMX (F). The effect of 100 nM glucagon (Fig 3G, open arrow) on the 
localisation of the PKAc-F64L-S65T-GFP fusion protein is also shown for BHK/GR, 
5 PKAc-F64L-S65T-GFP cells. The effect of 10 j.iM norepinephrine (H), solid arrow, on the 
localisation of the PKAc-F64L-S65T-GFP fusion protein was analysed similarly, in 
transiently transfected CHO, PKAc-F64L-S65T-GFP cells, pretreated with 10 |aM 
forskolin, open arrow, to increase [cAMP]. N.B. in Fig 3H the x-axis counts the image 
numbers, with 12 seconds between images. The raw data of each experiment consisted of 

10 60 fluorescence micrographs acquired at regular intervals including several images 
acquired before the addition of buffer or agonist. The charts (A-G) each show a 
quantification of the response seen through all the 60 images, performed as described in 
analysis method 2. The change in total area of the highly fluorescent aggregates, relative to 
the initial area of fluorescent aggregates is plotted as the ordinate in all graphs in Figure 3, 

15 versus time for each experiment. Scale bar 10 (am. 

Figure 4. Dose-response curve (two experiments) for forskolin-induced redistribution of 
the PKAc-F64L-S65T-GFP fusion. 

20 Figure 5. Time from initiation of a response to half maximal (ti/ 2ma x) and maximal (t max ) 
PKAc-F64L-S65T-GFP redistribution. The data was extracted from curves such as that 
shown in "Figure 2." All ti/ 2m ax and t max values are given as mean±SD and are based on a 
total of 26-30 cells from 2-3 independent experiments for each forskolin concentration. 
Since the observed redistribution is sustained over time, the t max values were taken as the 

25 earliest time point at which complete redistribution is reached. Note that the values do not 
relate to the degree of redistribution. 

Figure 6. Parallel dose-response analyses of forskolin induced cAMP elevation and PKAc- 
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F64L-S65T-GFP redistribution. The effects of buffer or 5 increasing concentrations of 
forskolin on the localisation of the PKAc-F64L-S65T-GFP fusion protein in CHO/PKAc- 
F64L-S65T-GFP cells, grown in a 96 well plate, were analysed as described above. 
Computing the ratio of the SD's of fluorescence micrographs taken of the same field of 

5 cells, prior to and 30 min after the addition of forskolin, gave a reproducible measure of 
PKAc-F64L-S65T-GFP redistribution. The graph shows the individual 48 measurements 
and a trace of their mean±s.e.m at each forskolin concentration. For comparison, the 
effects of buffer or 8 increasing concentrations of forskolin on [cAMP]; was analysed by a 
scintillation proximity assay of cells grown under the same conditions. The graph shows a 

10 trace of the mean ± s.e.m of 4 experiments expressed in arbitrary units. 

Figure 7. BHK cells stably transfected with the human muscarinic (hMl) receptor and the 
PKCa-F64L-S65T-GFP fusion. Carbachol (100 \xM added at 1.0 second) induced a 
transient redistribution of PKCot-F64L-S65T-GFP from the cytoplasm to the plasma 
15 membrane. Images were taken at the following times: a) 1 second before carbachol 
addition, b) 8.8 seconds after addition and c) 52.8 seconds after addition. 

Figure 8. BHK cells stably transfected with the hMl receptor and PKCa-F64L-S65T-GFP 
fusion were treated with carbachol (1 |aM, 10 jaM, 100 |aM). In single cells intracellular 

20 [Ca 2+ ] was monitored simultaneously with the redistribution of PKCa-F64L-S65T-GFP. 
Dashed line indicates the addition times of carbachol. The top panel shows changes in the 
intracellular Ca 2+ concentration of individual cells with time for each treatment. The 
middle panel shows changes in the average cytoplasmic GFP fluorescence for individual 
cells against time for each treatment. The bottom panel shows changes in the fluorescence 

25 of the periphery of single cells, within regions that specifically include the circumferential 
edge of a cell as seen in normal projection, the best regions for monitoring changes in the 
fluorescence intensity of the plasma membrane. 
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Figure 9. 

a) The hERKl-F64L-S65T-GFP fusion expressed in HEK293 cells treated with 100 
of the MEK1 inhibitor PD98059 in HAM F-12 (without serum) for 30 minutes at 37 
°C. The nuclei empty of fluorescence during this treatment. 

5 b) The same cells as in (a) following treatment with 10 % foetal calf serum for 15 minutes 
at 37 °C. 

c) Time profiles for the redistribution of GFP fluorescence in HEK293 cells following 
treatment with various concentrations of EGF in Hepes buffer (HAM F-12 replaced 
with Hepes buffer directly before the experiment). Redistribution of fluorescence is 

10 expressed as the change in the ratio value between areas in nucleus and cytoplasm of 

single cells. Each time profile is the mean for the changes seen in six single cells. 

d) Bar chart for the end-point measurements, 600 seconds after start of EGF treatments, 
of fluorescence change (nucleusxytoplasm) following various concentrations of EGF. 



15 Figure 10. 

a) The SMAD2-EGFP fusion expressed in HEK293 cells starved of serum overnight in 
HAM F-12. HAM F-12 was then replaced with Hepes buffer pH 7.2 immediately before 
the experiment. Scale bar is 10 jam. 

b) HEK 293 cells expressing the SMAD2-EGFP fusion were treated with various 
20 concentration of TGF-beta as indicated, and the redistribution of fluorescence 

monitored against time. The time profile plots represent increases in fluorescence 
within the nucleus, normalised to starting values in each cell measured. Each trace is the 
time profile for a single cell nucleus. 

c) A bar chart representing the end-point change in fluorescence within nuclei (after 850 
25 seconds of treatment) for different concentrations of TGF-beta. Each bar is the value for 

a single nucleus in each treatment. 
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Figure 1 1 . The V ASP-F64L-S65T-GFP fusion in CHO cells stably transfected with the 
human insulin receptor. The cells were starved for two hours in HAM F-12 without serum, 
then treated with 10% foetal calf serum. The image shows the resulting redistribution of 
5 fluorescence after 15 minutes of treatment. GFP fluorescence becomes localised in 
structures identified as focal adhesions along the length of actin stress fibres. 

Figure 12. Time lapse recording GLUT4-GFP redistribution in CHO-HIR cells. Time 
indicates minutes after the addition of 100 nM insulin. 

10 

Figure 13. Dose-response relationships for the influence of insulin on the disappearance of 
total fluorescence from the centrally located area of GLUT4-GFP. Data points indicate 
mean±SE. 

15 Figure 14. Dose-response relationship for the translocation of PKCa-GFP in BHKhMl 
cells stimulated with the muscarininc agonist carbamylcholine using a FLIPR™ to do the 
actual experiments. 

Figure 15. Dose-response relationship for the translocation of PKAc-GFP in CHO/PKAc- 
20 F64L-S65T-GFP cells stimulated with forskolin using a FLIPR™ to do the actual 
experiments. 

Figure 16. Dose-response relationship for the disappearance of fluorescence from 
permeabilised CHO/PKAc-F64L-S65T-GFP when previously exposed to different doses of 
25 forskolin. 
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EXAMPLES 
EXAMPLE 1 

Construction, testing and implementation of an assay for cAMP based on PKA 
5 activation in real time within living cells. 

Useful for monitoring the activity of signalling pathways that lead to altered 
concentrations of cAMP, e.g. activation of G-protein coupled receptors which couple to G- 
proteins of the G s or G, class. 

The catalytic subunit of the murine cAMP dependent protein kinase (PKAc) was fused C- 
10 terminally to a F64L-S65T derivative of GFP. The resulting fusion (PKAc-F64L-S65T- 
GFP) was used for monitoring in vivo the translocation and thereby the activation of PKA. 

To construct the PKAc-F64L-S65T-GFP fusion, convenient restriction endonuclease sites 
were introduced into the cDNAs encoding murine PKAc (Gen Bank Accession number: 
M12303) and F64L-S65T-GFP (sequence disclosed in WO 97/11094) by polymerase chain 
15 reaction (PCR). The PCR reactions were performed according to standard protocols with 
the following primers: 

5'PKAc: 

TTggACACAAgCTTTggACACCCTCAggATATgggCAACgCCgCCgCCgCCAAg (SEQ 
IDNO:3), 

20 3TKAc: 

g TCATCTTCTCgAgTCTTTCAggCgCgCCCAAACTCAgTAAACTCCTTgCCACAC 
(SEQ ID NO:4) , 

5' GFP: TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTTTTC 
(SEQ ID NO:l), 

25 3'GFP: g TCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgCCATgT 
(SEQ IDNO:2). 
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The PKAc amplification product was then digested with Hindlll+AscI and the F64L- 
S65T-GFP product with Ascl+Xhol. The two digested PCR products were subsequently 
ligated with a Hindlll+Xhol digested plasmid (pZeoSV® mammalian expression vector, 
Invitrogen, San Diego, CA, USA). The resulting fusion construct (SEQ ID NO:68 & 69) 
5 was under control of the SV40 promoter. 

Transfection and cell culture conditions: 

Chinese hamster ovary cells (CHO), were transfected with the plasmid containing the 
PKAc-F64L-S65T-GFP fusion using the calcium phosphate precipitate method in HEPES- 

10 buffered saline (Sambrook et al. y 1989). Stable transfectants were selected using 1000 jag 
Zeocin/ml (Invitrogen) in the growth medium (DMEM with 1000 mg glucose/1, 10 % fetal 
bovine serum (FBS), 100 jag penicillin-streptomycin mixture ml" 1 , 2 mM L-glutamine 
purchased from Life Technologies Inc., Gaithersburg, MD, USA). Untransfected CHO 
cells were used as the control. To assess the effect of glucagon on fusion protein 

15 translocation, the PKAc-F64L-S65T-GFP fusion was stably expressed in baby hamster 
kidney cells overexpressing the human glucagon receptor (BHK/GR cells). Untransfected 
BHK/GR cells were used as the control. Expression of GR was maintained with 500 |ag 
G418/ml (Neo marker) andPKAc-F64L-S65T-GFP was maintained with 500 \ig Zeocin/ml 
(Sh ble marker). CHO cells were also simultaneously co-transfected with vectors 

20 containing the PKAc-F64L-S65T-GFP fusion and the human a2a adrenoceptor (hARa2a). 

For fluorescence microscopy, cells were allowed to adhere to Lab-Tek chambered 
coverglasses (Nalge Nunc Int., Naperville, IL, USA) for at least 24 hours and cultured to 
about 80% confluence. Prior to experiments, the cells were cultured over night without 
selection pressure in HAM F-12 medium with glutamax (Life Technologies), 100 |ig 
25 penicillin-streptomycin mixture ml" 1 and 0.3 % FBS. This medium has low 
autofluorescence enabling fluorescence microscopy of cells straight from the incubator. 

Monitoring activity of PKA activity in real time: 
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Image aquisition of live cells were gathered using a Zeiss Axiovert 135M fluorescence 
microscope fitted with a Fluar 40X, NA: 1.3 oil immersion objective and coupled to a 
Photometries CH250 charged coupled device (CCD) camera. The cells were illuminated 
with a 100 W HBO arc lamp. In the light path was a 470±20 nm excitation filter, a 510 nm 
5 dichroic mirror and a 515±15 nm emission filter for minimal image background. The cells 
were maintained at 37°C with a custom built stage heater. 

Images were processed and analysed in the following manner: 

Method 1 : Stepwise procedure for quantitation of translocation of PKA: 

1. The image was corrected for dark current by performing a pixel-by-pixel subtraction of 
10 a dark image (an image taken under the same conditions as the actual image, except the 

camera shutter is not allowed to open). 

2. The image was corrected for non-uniformity of the illumination by performing a pixel- 
by-pixel ratio with a flat field correction image (an image taken under the same 
conditions as the actual image of a uniformly fluorescent specimen). 

15 3. The image histogram, i.e., the frequency of occurrence of each intensity value in the 
image, was calculated. 

4. A smoothed, second derivative of the histogram was calculated and the second zero is 
determined. This zero corresponds to the inflection point of the histogram on the high 
side of the main peak representing the bulk of the image pixel values. 

20 5. The value determined in step 4 was subtracted from the image. All negative values 
were discarded. 

6. The variance (square of the standard deviation) of the remaining pixel values was 
determined. This value represents the "response" for that image. 

7. Scintillation proximity assay (SPA) for independent quantitation of cAMP. 



25 



Method 2: Alternative method for quantitation of PKA redistribution: 
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1. The fluorescent aggregates are segmented from each image using an automatically 
found threshold based on the maximisation of the information measure between the 
object and background- The a priori entropy of the image histogram is used as the 
information measure. 

2. The area of each image occupied by the aggregates is calculated by counting pixels in 
the segmented areas. 

3. The value obtained in step 2 for each image in a series, or treatment pair, is normalised 
to the value found for the first (unstimulated) image collected. A value of zero (0) 
indicates no redistribution of fluorescence from the starting condition. A value of one 
(1) by this method equals full redistribution. 

Cells were cultured in HAM F-12 medium as described above, but in 96-well plates. The 
medium was exchanged with Ca 2+ -HEPES buffer including 100 (iM IBMX and the cells 
were stimulated with different concentrations of forskolin for 10 min. Reactions were 
stopped with addition of NaOH to 0.14 M and the amount of cAMP produced was 
measured with the cAMP-SPA kit, RPA538 (Amersham) as described by the 
manufacturer. 

Manipulating intracellular levels of cAMP to test the PKAc-F64L-S65T-GFP fusion. 

The following compounds were used to vary cAMP levels: Forskolin, an activator of 
adenylate cyclase; dbcAMP, a membrane permeable cAMP analog which is not degraded 
by phosphodiesterase; IBMX, an inhibitor of phosphodiesterase. 

CHO cells stably expressing the PKAc-F64L-S65T-GFP, showed a dramatic translocation 
of the fusion protein from a punctate distribution to an even distribution throughout the 
cytoplasm following stimulation with 1 jaM forskolin (n=3), 10 |uM forskolin (n-4) and 
50 |aM forskolin (n=4) (Fig 1), or dbcAMP at ImM (n=6). 

Fig. 2 shows the progression of response in time following treatment with 1 jaM forskolin. 
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Fig. 3 gives a comparison of the average temporal profiles of fusion protein redistribution 
and a measure of the extent of each response to the three forskolin concentrations (Fig. 3A, 
E, B), and to 1 mM dbcAMP (fig 3C) which caused a similar but slower response, and to 
addition of 100 |aM IBMX (n=4, Fig. 3D) which also caused a slow response, even in the 
5 absence of adenylate cyclase stimulation. Addition of buffer (n=2) had no effect (data not 
shown). 

As a control for the behaviour of the fusion protein, F64L-S65T-GFP alone was expressed 
in CHO cells and these were also given 50 jaM forskolin (n=5); the uniform diffuse 
distribution characteristic of GFP in these cells was unaffected by such treatment (data not 
10 shown). 

The forskolin-induced translocation of PKAc-F64L-S65T-GFP showed a dose-response 
relationship (Fig 4 and 6), see quantitative procedures above. 

Reversibility of PKAc-F64L-S65T-GFP translocation. 

15 The release of the PKAc probe from its cytoplasmic anchoring hotspots was reversible. 
Washing the cells repeatedly (5-8 times) with buffer after 10(iM forskolin treatment 
completely restored the punctate pattern within 2-5 min (n=2, Fig. 3E). In fact the fusion 
protein returned to a pattern of fluorescent cytoplasmic aggregates virtually 
indistinguishable from that observed before forskolin stimulation. 

20 To test whether the return of fusion protein to the cytoplasmic aggregates reflected a 
decreased [cAMP] i? cells were treated with a combination of 10 |aM forskolin and 100 jiM 
IBMX (n=2) then washed repeatedly (5-8 times) with buffer containing 100 |aM IBMX 
(Fig. 3F). In these experiments, the fusion protein did not return to its prestimulatory 
localisation after removal of forskolin. 

25 

Testing the PKA-F64L-S65T-GFP probe with physiologically relevant agents. 
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To test the probe's response to receptor activation of adenylate cyclase, BHK cells stably 
transfected with the glucagon receptor and the PKA-F64L-S65T-GFP probe were exposed 
to glucagon stimulation. The glucagon receptor is coupled to a G s protein which activates 
adenylate cyclase, thereby increasing the cAMP level. In these cells, addition of 100 nM 
5 glucagon (n=2) caused the release of the PKA-F64L-S65T-GFP probe from the 

cytoplasmic aggregates and a resulting translocation of the fusion protein to a more even 
cytoplasmic distribution within 2-3 min (Fig. 3G). Similar but less pronounced effects 
were seen at lower glucagon concentrations (n=2, data not shown). Addition of buffer 
(n=2) had no effect over time (data not shown). 

10 Transiently transfected CHO cells expressing hARa2a and the PKA-F64L-S65T-GFP 
probe were treated with 10 jaM forskolin for 7.5 minutes, then, in the continued presence 
of forskolin, exposed to 10 jaM norepinephrine to stimulate the exogenous 
adrenoreceptors, which couple to a G, protein, which inhibit adenylate cyclase. This 
treatment led to reappearance of fluorescence in the cytoplasmic aggregates indicative of a 

15 decrease in [cAMP] t (Fig. 3H). 

Fusion protein translocation correlated with [cAMP], 

As described above, the time it took for a response to come to completion was dependent 
on the forskolin dose (Fig. 5) In addition the degree of responses was also dose-dependent. 

20 To test the PKA-F64L-S65T-GFP fusion protein translocation in a semi high through-put 
system, CHO cells stably transfected with the PKA-F64L-S65T-GFP fusion was 
stimulated with buffer and 5 increasing doses of forskolin (n=8). Using the image analysis 
algorithm described above (Method 1), a dose-response relationship was observed in the 
range from 0.01-50 \jlM forskolin (Fig. 6). A half-maximal stimulation was observed at 

25 about 2 pM forskolin. In parallel, cells were stimulated with buffer and 8 increasing 
concentrations of forskolin (n=4) in the range 0.01-50 pM. The amount of cAMP produced 
was measured in an SPA assay. A steep increase was observed between 1 and 5 \jlM 
forskolin coincident with the steepest part of the curve for fusion protein translocation 
(also Fig. 6). 
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EXAMPLE 2 

Quantitation of redistribution in real-time within living cells. 

Probe for detection of PKC activity in real time within living cells: 
5 Construction of PKC-GFP fusion: 

The probe was constructed by ligating two restriction enzyme treated polymerase chain 
reaction (PCR) amplification products of the cDNA for murine PKCa (GenBank 
Accession number: M2581 1) and F64L-S65T-GFP (sequence disclosed in WO 97/1 1094) 
respectively. Taq® polymerase and the following oligonucleotide primers were used for 
10 PCR; 

5'mPKCa: 

TTggACACAAgCTTTggACACCCTCAggATATggCTgACgTTTACCCggCCAACg 
(SEQ ID NO:5), 

3'mPKCa: 

15 g TCATCTTCTCgAgTCTTTCAggCgCgCCCTACTgCACTTTgCAAgATTgggTgC (SEQ 
ID NO:6), 

5T64L-S65T-GFP: 

TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTTTTC (SEQ 
IDNO:l), 

20 3'F64L-S65T-GFP: 

g TCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgCCATgT (SEQ 

IDNO:2). 

The hybrid DNA strand was inserted into the pZeoSV® mammalian expression vector as a 
Hindlll-Xhol casette as described in example 1. 

25 BHK cells expressing the human Ml receptor under the control of the inducible 

metallothionine promoter and maintained with the dihydrofolate reductase marker were 
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transfected with the PKCa-F64L-S65T-GFP probe using the calcium phosphate precipitate 
method in HEPES buffered saline (HBS [pH 7.10]). Stable transfectants were selected 
using 1000 |Jg Zeocin®/ml in the growth medium (DMEM with 1000 mg glucose/1, 10 % 
foetal bovine serum (FBS), 100 |ig penicillin-streptomycin mixture ml-1, 2 mM 1- 
5 glutamine). The hMl receptor and PKCa-F64L-S65T-GFP fusion protein were maintained 
with 500 nM methotrexate and 500 (ig Zeocin®/ml respectively. 24 hours prior to any 
experiment, the cells were transferred to HAM F-12 medium with glutamax, 100 pg 
penicillin-streptomycin mixture ml 1 and 0.3 % FBS. This medium relieves selection 
pressure, gives a low induction of signal transduction pathways and has a low 
10 autofluorescence at the relevant wavelength enabling fluorescence microscopy of cells 
straight from the incubator. 

Method 1: Monitoring the PKCa activity in real time: 

Digital images of live cells were gathered using a Zeiss Axiovert 135M fluorescence 
15 microscope fitted with a 40X, NA: 1.3 oil immersion objective and coupled to a 

Photometries CH250 charged coupled device (CCD) camera. The cells were illuminated 
with a 100 W arc lamp. In the light path was a 470±20 nm excitation filter, a 510 nm 
dichroic mirror and a 515±15 nm emission filter for minimal image background. The cells 
were kept and monitored to be at 37°C with a custom built stage heater. 

20 Images were analyzed using the IPLab software package for Macintosh. 

Upon stimulation of the Ml-BHK cells, stably expressing the PKCa-F64L-S65T-GFP 
fusion, with carbachol we observed a dose-dependent transient translocation from the 
cytoplasm to the plasma membrane (Fig. 7a,b,c). Simultaneous measurement of the 
cytosolic free calcium concentration shows that the carbachol-induced calcium 
25 mobilisation precedes the translocation (Fig. 8). 

Stepwise procedure for quantitation of translocation of PKCa: 
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1. The image was corrected for dark current by performing a pixel-by-pixel subtraction 
of a dark image (an image taken under the same conditions as the actual image, except 
the camera shutter is not allowed to open). 

2. The image was corrected for non-uniformity of the illumination by performing a 
pixel-by-pixel ratio with a flat field correction image (an image taken under the same 
conditions as the actual image of a uniformly fluorescent specimen). 

3. A copy of the image was made in which the edges are identified. The edges in the 
image are found by a standard edge-detection procedure - convolving the image with 
a kernel which removes any large-scale unchanging components (i.e., background) 
and accentuates any small-scale changes (i.e., sharp edges). This image was then 
converted to a binary image by threshholding. Objects in the binary image which are 
too small to represent the edges of cells were discarded. A dilation of the binary image 
was performed to close any gaps in the image edges. Any edge objects in the image 
which were in contact with the borders of the image are discarded. This binary image 
represents the edge mask. 

4. Another copy of image was made via the procedure in step 3. This copy was further 
processed to detect objects which enclose "holes" and setting all pixels inside the 
holes to the binary value of the edge, i.e., one. This image represents the whole cell 
mask. 

5. The original image was masked with the edge mask from step 3 and the sum total of 
all pixel values is determined. 

6. The original image was masked with the whole cell mask from step 4 and the sum 
total of all pixel values was determined. 

7. The value from step 5 was divided by the value from step 6 to give the final result, the 
fraction of fluorescence intensity in the cells which was localized in the edges. 



EXAMPLE 3 
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Probes for detection of mitogen activated protein kinase Erkl redistribution. 

Useful for monitoring signalling pathways involving MAPK, e.g. to identify compounds 
which modulate the activity of the pathway in living cells. 

Erkl, a serine/threonine protein kinase, is a component of a signalling pathway that is 
5 activated by e.g. many growth factors. 

Probes for detection of ERK-1 activity in real time within living cells: 

The extracellular signal regulated kinase (ERK-1, a mitogen activated protein kinase, 
MAPK) is fused N- or C-terminally to a derivative of GFP. The resulting fusions 
expressed in different mammalian cells are used for monitoring in vivo the nuclear 
10 translocation, and thereby the activation, of ERK1 in response to stimuli that activate the 
MAPK pathway. 

a) Construction of murine ERK1 - F64L-S65T-GFP fusion: 

Convenient restriction endonuclease sites are introduced into the cDNAs encoding 
murine ERK1 (GenBank Accession number: Z14249) and F64L-S65T-GFP (sequence 
15 disclosed in WO 97/1 1094) by polymerase chain reaction (PGR). The PCR reactions are 
performed according to standard protocols with the following primers: 

5'ERKl: 

TTggACACAAgCTTTggACACCCTCAggATATggCggCggCggCggCggCTCCgggggg 
Cgggg (SEQ ID NO:7), 

20 3'ERKl: 

gTCATCTTCTCgAgTCTTTCAggCgCgCCCggggCCCTCTggCgCCCCTggCTgg 
(SEQ ID NO:8), 

5T64L-S65T-GFP: 

TTggACACAAgCTTTggACACggCgCgCCATgAgTAAAggAgAAgAACTTTTC 
25 (SEQ ID NO: 1) 
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3T64L-S65T-GFP: 

gTCATCTTCTCgAgTCTTACTCCTgAggTTTgTATAgTTCATCCATgCCATgT (SEQ 
ID NO:2) 

To generate the mERKl-F64L-S65T-GFP (SEQ ID NO:56 & 57) fusion the ERK1 
5 amplification product is digested with Hindlll+AscI and the F64L-S65T-GFP product 
with Ascl+Xhol. To generate the F64L-S65T-GFP-mERKl fusion the ERK1 
amplification product is then digested with HindIII+Bsu36I and the F64L-S65T-GFP 
product with Bsu36I+XhoI.The two pairs of digested PCR products are subsequently 
ligated with a Hindlll+Xhol digested plasmid (pZeoSV® mammalian expression 
10 vector, Invitrogen, San Diego, CA, USA). The resulting fusion constructs are under 
control of the SV40 promoter. 

b) The human Erkl gene (GenBank Accession number: X60188) was amplified using 
PCR according to standard protocols with primers Erkl -top (SEQ ID NO:9) and Erkl- 
bottomAKstop (SEQ ID NO: 10) . The PCR product was digested with restriction 
15 enzymes EcoRl and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; 

GenBank Accession number U55763) digested with EcoRl and BamHl. This produces 
an EGFP-Erkl fusion (SEQ ID NO:38 &39) under the control of a CMV promoter. 

The plamid containing the EGFP-Erkl fusion was transfected into HEK293 cells 
employing the FUGENE transfection reagent (Boehringer Mannheim). Prior to 

20 experiments the cells were grown to 80%-90% confluency 8 well chambers in DMEM 
with 10% FCS. The cells were washed in plain HAM F-12 medium (without FCS), and 
then incubated for 30-60 minutes in plain HAM F-12 (without FCS) with 100 micromolar 
PD98059, an inhibitor of MEK1, a kinase which activates Erkl; this step effectively 
empties the nucleus of EGFP-Erkl. Just before starting the experiment, the HAM F-12 was 

25 replaced with Hepes buffer following a wash with Hepes buffer. This removes the 

PD98059 inhibitor; if blocking of MEK1 is still wanted (e.g. in control experiments), the 
inhibitor is included in the Hepes buffer. 

The experimental setup of the microscope was as described in example 1. 



22132DK1 Appendix A 



22129DK1 



53 



60 images were collected with 10 seconds between each, and with the test compound 
added after image number 10. 

Addition of EGF (1-100 nM) caused within minutes a redistribution of EGFP-Erkl from 
the cytoplasm into the nucleus (Fig. 9a, b). 

5 The response was quantitated as described below and a dose-dependent relationship 
between EGF concentration and nuclear translocation of EGFP-Erkl was found (Fig. 
9c,d). Redistribution of GFP fluorescence is expressed in this example as the change in the 
ratio value between areas in nuclear versus cytoplasmic compartments of the cell. Each 
time profile is the average of nuclear to cytoplasmic ratios from six cells in each treatment. 

10 

EXAMPLE 4 

Probes for detection of Erk2 redistribution. 

Useful for monitoring signalling pathways involving MAPK, e.g. to identify compounds 
which modulate the activity of the pathway in living cells. 

15 Erk2, a serine/threonine protein kinase, is closely related to Erkl but not identical; it is a 
component of a signalling pathway that is activated by e.g. many growth factors. 

a) The rat Erk2 gene (GenBank Accession number: M64300) was amplified using PCR 
according to standard protocols with primers Erk2-top (SEQ ID NO:l 1) and Erk2- 
bottomAfstop (SEQ ID NO: 13) The PCR product was digested with restriction enzymes 

20 Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 

Accession number U55763) digested with Xhol and BamHl. This produces an EGFP- 
Erk2 fusion (SEQ ID NO:40 &41) under the control of a CMV promoter. 

b) The rat Erk2 gene (GenBank Accession number: M64300) was amplified using PCR 
according to standard protocols with primers (SEQ ID NO:l 1) Erk2-top and Erk2- 

25 bottomAstop (SEQ ID NO: 12). The PCR product was digested with restriction enzymes 

Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 



22132DK1 Appendix A 



22129DK1 



54 



Accession number U55762) digested with Xhol and BamHl. This produces an Erk2- 
EGFP fusion (SEQ ID NO:58 &59) under the control of a CMV promoter. 

The resulting plasmids were transfected into CHO cells and BHK cells. The cells were 
grown under standard conditions. Prior to experiments, the cells were starved in medium 
without serum for 48-72 hours. This led to a predominantly cytoplasmic localisation of 
both probes, especially in BHK cells. 10% fetal calf serum was added to the cells and the 
fluorescence of the cells was recorded as explained in example 3. Addition of serum 
caused the probes to redistribute into the nucleus within minutes of addition of serum. 

EXAMPLE 5 

Probes for detection of Smad2 redistribution. 

Useful for monitoring signalling pathways activated by some members of the transforming 
growth factor-beta family, e.g. to identify compounds which modulate the activity of the 
pathway in living cells. 

Smad 2, a signal transducer, is a component of a signalling pathway that is induced by 
some members of the TGFbeta family of cytokines. 

a) The human Smad2 gene (GenBank Accession number: AF027964) was amplified using 
PCR according to standard protocols with primers Smad2-top (SEQ ID NO:24) and 
Smad2-bottom/+stop (SEQ ID NO:26) . The PCR product was digested with restriction 
enzymes EcoRl and Acc65I, and ligated into pEGFP-Cl (Clontech; Palo Alto; 
GenBank Accession number U55763) digested with EcoRl and Acc65I. This produces 
an EGFP-Smad2 fusion (SEQ ID NO:50&51) under the control of a CMV promoter. 

b) The human Smad2 gene (GenBank Accession number: AF027964) was amplified using 
PCR according to standard protocols with primers Smad2-top (SEQ ID NO:24) and 
Smad2-bottom/-stop (SEQ ID NO:25) . The PCR product was digested with restriction 
enzymes EcoRl and Acc65I, and ligated into pEGFP-Nl (Clontech, Palo Alto; 
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GenBank Accession number U55762) digested with EcoRl and Acc65I. This produces 
a Smad2-EGFP fusion (SEQ ID NO:74 &75) under the control of a CMV promoter. 

The plasmid containing the EGFP-Smad2 fusion was transfected into HEK293 cells, 
where it showed a cytoplasmic distribution. Prior to experiments the cells were grown in 8 
5 well Nunc chambers in DMEM with 10% FCS to 80% confluence and starved overnight in 
HAM F-12 medium without FCS. 

For experiments, the HAM F-12 medium was replaced with Hepes buffer pH 7.2. 

The experimental setup of the microscope was as described in example 1. 

90 images were collected with 10 seconds between each, and with the test compound 
10 added after image number 5. 

After serum starvation of cells, each nucleus contains less GFP fluorescence than the 
surrounding cytoplasm (Fig. 10a). Addition of TGFbeta caused within minutes a 
redistribution of EGFP-Smad2 from the cytoplasma into the nucleus (Fig. 10b). 

The redistribution of fluorescence within the treated cells was quantified simply as the 
15 fractional increase in nuclear fluorescence normalised to the starting value of GFP 
fluorescence in the nucleus of each unstimulated cell. 

EXAMPLE 6 

Probe for detection of VASP redistribution. 

20 Useful for monitoring signalling pathways involving rearrangement of cytoskeletal 

elements, e.g. to identify compounds which modulate the activity of the pathway in living 
cells. 

VASP, a phosphoprotein, is a component of cytoskeletal structures, which redistributes in 
response to signals that affect focal adhesions. 
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The human VASP gene (GenBank Accession number: Z46389) was amplified using PCR 
according to standard protocols with primers VASP-top (SEQ ID NO:94) and VASP- 
bottomZ+stop (SEQ ID NO:95). The PCR product was digested with restriction enzymes 
Hind3 and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank Accession 
5 number U55763) digested with Hind3and BamHl. This produces an EGFP-VASP fusion 
(SEQ ID NO: 124 &125) under the control of a CMV promoter. 

The resulting plasmid was transfected into CHO cells expressing the human insulin 
receptor using the calcium-phosphate transfection method. Prior to experiments, cells were 
grown in 8 well Nunc chambers and starved overnight in medium without FCS. 

10 Experiments are performed in a microscope setup as described in example 1. 

10% FCS was added to the cells and images were collected. The EGFP-VASP fusion was 
redistributed from a somewhat even distribution near the periphery into more localised 
structures, identified as focal adhesion points (Fig. 11). 

A large number of further GFP fusions have been made or are in the process of being 
15 made, as apparent from the following Examples 7-22 which also suggest suitable host cells 
and substances for activation of the cellular signalling pathways to be monitored and 
analyzed. 

EXAMPLE 7 

20 Probe for detection of actin redistribution. 

Useful for monitoring signalling pathways involving rearrangement or formation of actin 
filaments, e.g. to identify compounds which modulate the activity of pathways leading to 
cytoskeletal rearrangements in living cells. 

Actin is a component of cytoskeletal structures, which redistributes in response to very 
25 many cellular signals. 

The actin binding domain of the human alpha-actinin gene (GenBank Accession number: 
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XI 5804) was amplified using PCR according to standard protocols with primers ABD-top 
(SEQ ID NO:90) and ABD-bottom/-stop (SEQ ID NO:91). The PCR product was digested 
with restriction enzymes Hind3 and BamHl, and ligated into pEGFP-Nl (Clontech, Palo 
Alto; GenBank Accession number U55762) digested with Hind3 and BamHl. This 
5 produced an actin-binding-domain-EGFP fusion (SEQ ID NO: 128 &129) under the control 
of a CMV promoter. 

The resulting plasmid was transfected into CHO cells expressing the human insulin 
receptor. Cells were stimulated with insulin that caused the actin binding domain-EGFP 
probe to become redistributed into morphologically distinct membrane-associated 
10 structures. 



EXAMPLE 8 

Probes for detection of p38 redistribution. 

Useful for monitoring signalling pathways responding to various cellular stress situations, 
15 e.g. to identify compounds which modulate the activity of the pathway in living cells, or as 
a counterscreen. 

p38, a serine/threonine protein kinase, is a component of a stress-induced signalling 
pathway which is activated by many types of cellular stress, e.g. TNFalpha, anisomycin, 
UV and mitomycin C. 

20 a) The human p38 gene (GenBank Accession number: L35253) was amplified using PCR 
according to standard protocols with primers p38-top (SEQ ID NO: 14) and p38- 
bottomAfstop (SEQ ID NO: 16). The PCR product was digested with restriction 
enzymes Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and BamHl. This produced an EGFP- 

25 p38 fusion (SEQ ID NO:46 & 47) under the control of a CMV promoter. 

b) The hu man p38 gene (GenBank Accession number: L35253) was amplified using PCR 
according to standard protocols with primers p38-top (SEQ ID NO: 13) and p38- 
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bottom/-stop (SEQ ID NO: 15) . The PCR product was digested with restriction 
enzymes Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Xhol and BamHl. This produced a p38- 
EGFP fusion (SEQ ID NO:64 & 65) under the control of a CMV promoter. 

5 The resulting plasmids are transfected into a suitable cell line, e.g. HEK293, in which the 
EGFP-p38 probe and/or the p38-EGFP probe should change its cellular distribution from 
predominantly cytoplasmic to nuclear within minutes in response to activation of the 
signalling pathway with e.g. anisomycin. 

10 EXAMPLE 9 

Probes for detection of Jnkl redistribution. 

Useful for monitoring signalling pathways responding to various cellular stress situations, 
e.g. to identify compounds which modulate the activity of the pathway in living cells, or as 
a counterscreen. 

15 Jnkl, a serine/threonine protein kinase, is a component of a stress-induced signalling 

pathway different from the p38 described above, though it also is activated by many types 
of cellular stress, e.g. TNFalpha, anisomycin and UV. 

a) The human Jnkl gene (GenBank Accession number: L26318) was amplified using PCR 
according to standard protocols with primers Jnk-top (SEQ ID NO: 17) and Jnk- 

20 bottom/+stop (SEQ ID NO: 19) . The PCR product was digested with restriction 

enzymes Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and BamHl. This produced an EGFP- 
Jnkl fusion (SEQ ID NO:44 &45) under the control of a CMV promoter. 

b) The human Jnkl gene (GenBank Accession number: L26318) was amplified using PCR 
25 according to standard protocols with primers Jnk-top (SEQ ID NO: 17) and Jnk- 

bottorrV-stop (SEQ ID NO: 18) . The PCR product was digested with restriction 
enzymes Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 



22132DK1 Appendix A 



22129DK1 



59 



Accession number U55762) digested with Xhol and BamHl. This produced a Jnkl- 
EGFP fusion (SEQ ID NO:62 &63) under the control of a CM V promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. HEK293, in which the 
EGFP-Jnkl probe and/or the Jnkl-EGFP probe should change its cellular distribution from 
5 predominantly cytoplasmic to nuclear in response to activation of the signalling pathway 
with e.g. anisomycin. 

EXAMPLE 10 

Probes for detection of PKG redistribution. 

10 Useful for monitoring signalling pathways involving changes in cyclic GMP levels, e.g. to 
identify compounds which modulate the activity of the pathway in living cells. 

PGK, a cGMP-dependent serine/threonine protein kinase, mediates the guanylyl- 
cyclase/cGMP signal. 

a) The human PKG gene (GenBank Accession number: Y07512) is amplified using PCR 
15 according to standard protocols with primers PKG-top (SEQ ID NO:81) and PKG- 

bottom/+stop (SEQ ID NO:83) . The PCR product is digested with restriction enzymes 
Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and BamHl. This produces' an EGFP- 
PKG fusion (SEQ ID NO: 134 &135) under the control of a CM V promoter. 

20 b) The human PKG gene (GenBank Accession number: Y07512) is amplified using PCR 
according to standard protocols with primers PKG-top (SEQ ID NO:81) and PKG- 
bottomAstop (SEQ ID NO: 82) . The PCR product is digested with restriction enzymes 
Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Xhol and BamHl. This produces a PKG- 

25 EGFP fusion (SEQ ID NO: 1 36 & 1 37) under the control of a CM V promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. A10, in which the 
EGFP-PKG probe and/or the PKG-EGFP probe should change its cellular distribution 
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from cytoplasmic to one associated with cytoskeletal elements within minutes in response 
to treatment with agents which raise nitric oxide (NO) levels. 



EXAMPLE 1 1 

5 Probes for detection of IkappaB kinase redistribution. 

Useful for monitoring signalling pathways leading to NFkappaB activation, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

IkappaB kinase, a serine/threonine kinase, is a component of a signalling pathway which is 
activated by a variety of inducers including cytokines, lymphokines, growth factors and 
10 stress. 

a) The alpha subunit of the human IkappaB kinase gene (GenBank Accession number: 
AF009225) is amplified using PCR according to standard protocols with primers IKK- 
top (SEQ ID NO:96) and IKK-bottom/+stop (SEQ ID NO:98). The PCR product is 
digested with restriction enzymes EcoRl and Acc65L and ligated into pEGFP-Cl 

15 (Clontech, Palo Alto; GenBank Accession number U55763) digested with EcoRland 

Acc65L This produces an EGFP-IkappaB-kinase fusion (SEQ ID NO:120 &121) under 
the control of a CMV promoter. 

b) The alpha subunit of the human IkappaB kinase gene (GenBank Accession number: 
AF009225) is amplified using PCR according to standard protocols with primers IKK- 

20 top (SEQ ID NO:96) and IKK-bottom/-stop (SEQ ID NO:97). The PCR product is 
digested with restriction enzymes EcoRl and Acc651, and ligated into pEGFP-Nl 
(Clontech, Palo Alto; GenBank Accession number U55762) digested with EcoRl and 
Acc65I. This produces an IkappaB-kinase-EGFP fusion ( SEQ ID NO: 122 &123) under 
the control of a CMV promoter. 

25 The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-IkappaB-kinase probe and/or the IkappaB-kinase-EGFP probe should achieve a 
more cytoplasmic distribution within seconds following stimulation with e.g. TNFalpha. 
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EXAMPLE 12 

Probes for detection of CDK2 redistribution. 

Useful for monitoring signalling pathways of the cell cycle, e.g. to identify compounds 
5 that modulate the activity of the pathway in living cells. 

CDK2, a cyclin-dependent serine/threonine kinase, is a component of the signalling system 
that regulates the cell cycle. 

a) The human CDK2 gene (GenBank Accession number: X61622) is amplified using PCR 
according to standard protocols with primers CDK2-top (SEQ ID NO: 102) and CDK2- 

10 bottomAfstop (SEQ ID NO: 104). The PCR product is digested with restriction enzymes 
Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and BamHl. This produces an EGFP- 
CDK2 fusion (SEQ ID NO: 114 &115) under the control of a CMV promoter. 

b) The human CDK2 gene (GenBank Accession number: X61622) is amplified using PCR 
15 according to standard protocols with primers CDK2-top (SEQ ID NO: 102) and CDK2- 

bottom/-stop (SEQ ID NO: 103). The PCR product is digested with restriction enzymes 
Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Xhol and BamHl. This produces a CDK2- 
EGFP fusion (SEQ ID NO: 1 1 2 & 1 1 3) under the control of a CMV promoter. 

20 The resulting plasmids are transfected into a suitable cell line, e.g. HEK293 in which the 
EGFP-CDK2 probe and/or the CDK2-EGFP probe should change its cellular distribution 
from cytoplasmic in contact-inhibited cells, to nuclear location in response to activation 
with a number of growth factors, e.g. IGF. 



25 EXAMPLE 13 

Probes for detection of Grk5 redistribution. 
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Useful for monitoring signalling pathways involving desensitisation of G-protein coupled 
receptors, e.g. to identify compounds which modulate the activity of the pathway in living 
cells. 

Grk5, a G-protein coupled receptor kinase, is a component of signalling pathways 
5 involving membrane bound G-protein coupled receptors. 

a) The human Grk5 gene (GenBank Accession number: LI 5388) is amplified using PCR 
according to standard protocols with primers Grk5-top (SEQ ID NO:27) and Grk5- 
bottom/+stop (SEQ ID NO:29). The PCR product is digested with restriction enzymes 

10 EcoRl and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 

Accession number U55763) digested with EcoRl and BamHl. This produces an EGFP- 
Grk5 fusion (SEQ ID NO:42 &43) under the control of a CMV promoter. 

b) The human Grk5 gene (GenBank Accession number: L15388) is amplified using PCR 
according to standard protocols with primers Grk5-top (SEQ ID NO:27) and Grk5- 

15 bottomAstop (SEQ ID NO:28). The PCR product is digested with restriction enzymes 
EcoRl and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with EcoRl and BamHl. This produces a Grk5- 
EGFP fusion (SEQ ID NO:60 &61) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. HEK293 expressing a 
20 rat dopamine Dl A receptor, in which the EGFP-Grk5 probe and/or the Grk5-EGFP probe 
should change its cellular distribution from predominantly cytoplasmic to peripheral in 
response to activation of the signalling pathway with e.g. dopamine. 

EXAMPLE 14 

25 Probes for detection of Zap70 redistribution. 

Useful for monitoring signalling pathways involving the T cell receptor, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 
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Zap70, a tyrosine kinase, is a component of a signalling pathway which is active in e.g. T- 
cell differentiation. 

a) The human Zap70 gene (GenBank Accession number: L05148) is amplified using PCR 
according to standard protocols with primers Zap70-top (SEQ ID NO: 105) and Zap70- 
bottom/+stop (SEQ ID NO: 107). The PCR product is digested with restriction enzymes 
EcoRl and BamHl, and ligated into pEGFP-Cl (GenBank Accession number U55763) 
digested with EcoRl and BamHl. This produces an EGFP-Zap70 fusion (SEQ ID 
NO: 108 &109) under the control of a CM V promoter. 

b) The human Zap70 gene (GenBank Accession number: LOS 148) is amplified using PCR 
according to standard protocols with primers Zap70-top (SEQ ID NO: 105) and Zap70- 
bottonV-stop (SEQ ID NO: 106). The PCR product is digested with restriction enzymes 
EcoRl and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with EcoRl and BamHl. This produces a Zap70- 
EGFP fusion (SEQ ID NO:l 10 &1 1 1) under the control of a CM V promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-Zap70 probe and/or the Zap70-EGFP probe should change its cellular distribution 
from cytoplasmic to membrane-associated within seconds in response to activation of the 
T cell receptor signalling pathway with e.g. antibodies to CD3epsilon. 

EXAMPLE 15 

Probes for detection of p85 redistribution. 

Useful for monitoring signalling pathways involving PI-3 kinase, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

p85alpha is the regulatory subunit of PI3-kinase which is a component of many pathways 
involving membrane-bound tyrosine kinase receptors and G-protein-coupled receptors. 

a) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PCR according to standard protocols with primers p85-top-C (SEQ ID NO:22) and p85- 
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bottomAfstop (SEQ ID NO:23) . The PCR product was digested with restriction 
enzymes Bgl2 and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Bgl2 and BamHl. This produced an EGFP- 
p85alpha fusion (SEQ ID NO:48 &49) under the control of a CMV promoter. 

5 b) The human p85alpha gene (GenBank Accession number: M61906) was amplified using 
PCR according to standard protocols with primers p85-top-N (SEQ ID NO:20) and p85- 
bottomAstop (SEQ ID NO:21) . The PCR product was digested with restriction 
enzymes EcoRl and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; 
GenBank Accession number U55762) digested with EcoRl and BamHl. This produced 

10 a p85alpha-EGFP fusion (SEQ ID NO:66 &67) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. CHO expressing the 
human insulin receptor, in which the EGFP-p85 probe and/or the p85-EGFP probe may 
change its cellular distribution from cytoplasmic to membrane-associated within minutes 
in response to activation of the receptor with insulin. 

15 

EXAMPLE 16 

Probes for detection of protein-tyrosine phosphatase redistribution. 

Useful for monitoring signalling pathways involving tyrosine kinases, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

20 Protein-tyrosine phosphatase 1C, a tyrosine-specific phosphatase, is an inhibitory 
component in signalling pathways involving e.g. some growth factors. 

a) The human protein-tyrosine phosphatase 1C gene (GenBank Accession number: 

X62055) is amplified using PCR according to standard protocols with primers PTP-top 
(SEQ ID NO:99) and PTP-bottomAfstop (SEQ ID NO: 101). The PCR product is 
25 digested with restriction enzymes Xhol and EcoRl, and ligated into pEGFP-Cl 

(Clontech, Palo Alto; GenBank Accession number U55763) digested with Xhol and 
EcoRl. This produces an EGFP-PTP fusion (SEQ ID NO: 1 16 & 1 17) under the control 



22132DK1 Appendix A 



22129DK1 



65 



of a CMV promoter. 

b) The human protein-tyrosine phosphatase 1C gene (GenBank Accession number: 

X62055) is amplified using PCR according to standard protocols with primers PTP-top 
(SEQ ID NO:99) and PTP-bottom/-stop (SEQ ID NO: 100). The PCR product is 
5 digested with restriction enzymes Xhol and EcoRl, and ligated into pEGFP-Nl 

(Clontech, Palo Alto; GenBank Accession number U55762) digested with Xhol and 
EcoRl. This produces a PTP-EGFP fusion (SEQ ID NO:l 18 & 1 19) under the control 
of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. MCF-7 in which the 
10 EGFP-PTP probe and/or the PTP-EGFP probe should change its cellular distribution from 
cytoplasm to the plasma menbrane within minutes in response to activation of the growth 
inhibitory signalling pathway with e.g. somatostatin. 

EXAMPLE 17 

15 Probes for detection of Smad4 redistribution. 

Useful for monitoring signalling pathways involving most members of the transforming 
growth factor-beta family, e.g. to identify compounds which modulate the activity of the 
pathway in living cells. 

Smad4, a signal transducer, is a common component of signalling pathways induced by 
20 various members of the TGFbeta family of cytokines. 

a) The human Smad4 gene (GenBank Accession number: U44378) was amplified using 
PCR according to standard protocols with primers Smad4-top and Smad4-bottom/+stop 
(SEQ ID NO:35) . The PCR product was digested with restriction enzymes EcoRl and 
BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank Accession number 

25 U55763) digested with EcoRl and BamHl. This produce an EGFP-Smad4 fusion (SEQ 
ID NO:52 & 53) under the control of a CMV promoter. 

b) The human Smad4 gene (GenBank Accession number: U44378) was amplified using 
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PCR according to standard protocols with primers Smad4-top (SEQ ID NO:33) and 
Smad4-bottom/-stop (SEQ ID NO:34). The PCR product was digested with restriction 
enzymes EcoRl and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; 
GenBank Accession number U55762) digested with EcoRl and BamHl. This produced 
5 a Smad4-EGFP fusion (SEQ ID NO:76 & 77) under the control of a CMV promoter. 

The resulting plasmids are transfected into a cell line, e.g. HEK293 in which the EGFP- 
Smad4 probe and/or the Smad4-EGFP probe should change its cellular distribution within 
minutes from cytoplasmic to nuclear in response to activation of the signalling pathway 
with e.g. TGFbeta. 

10 

EXAMPLE 18 

Probes for detection of StatS redistribution. 

Useful for monitoring signalling pathways involving the activation of tyrosine kinases of 
the Jak family, e.g. to identify compounds that modulate the activity of the pathway in 
15 living cells. 

Stat5, signal transducer and activator of transcription, is a component of signalling 
pathways that are induced by e.g. many cytokines and growth factors. 

a) The human StatS gene (GenBank Accession number: L41 142) was amplified using 
20 PCR according to standard protocols with primers Stat5-top (SEQ ID NO:30) and 

Stat5-bottom/+stop (SEQ ID NO:32). The PCR product was digested with restriction 
enzymes Bgl2 and Acc65I, and ligated into pEGFP-Cl (Clontech; Palo Alto; GenBank 
Accession number U55763) digested with Bgl2 and Acc65I. This produced an EGFP- 
StatS fusion (SEQ ID NO:54 & 55) under the control of a CMV promoter. 

25 b) The human StatS gene (GenBank Accession number: L41 142) was amplified using 
PCR according to standard protocols with primers Stat5-top (SEQ ID NO:30) and 
Stat5-bottom/-stop (SEQ ID NO:331). The PCR product was digested with restriction 
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enzymes Bgl2 and Acc65I, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Bgl2 and Acc65I. This produced a Stat5- 
EGFP fusion (SEQ ID NO:78 & 79) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. MIN6 in which the 
5 EGFP-Stat5 probe and/or the Stat5-EGFP probe should change its cellular distribution 
from cytoplasmic to nuclear within minutes in response to activation signalling pathway 
with e.g. prolactin. 

EXAMPLE 19 

10 Probes for detection of NFAT redistribution. 

Useful for monitoring signalling pathways involving activation of NFAT, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

NFAT, an activator of transcription, is a component of signalling pathways involved in e.g. 
immune responses. 

15 a) The human NFAT1 gene (GenBank Accession number: U43342) is amplified using 
PCR according to standard protocols with primers NFAT-top (SEQ ID NO:84) and 
NFAT-bottom/+stop (SEQ ID NO:86). The PCR product is digested with restriction 
enzymes Xhol and EcoRl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and EcoRl. This produces an EGFP- 

20 NFAT fusion (SEQ ID NO: 1 30 & 1 3 1 ) under the control of a CMV promoter. 

b) The human NFAT gene (GenBank Accession number: U43342) is amplified using PCR 
according to standard protocols with primers NFAT-top (SEQ ID NO:84) and NFAT- 
bottomAstop (SEQ ID NO:85). The PCR product is digested with restriction enzymes 
Xhol and EcoRl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
25 Accession number U55762) digested with Xhol and EcoRl. This produces an NFAT- 
EGFP fusion (SEQ ID NO: 132 & 133) under the control of a CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 



22132DK1 Appendix A 



22129DK1 



68 



EGFP-NFAT probe and/or the NFAT-EGFP probe should change its cellular distribution 
from cytoplasmic to nuclear within minutes in response to activation of the signalling 
pathway with e.g. antibodies to CD3epsilon. 



5 EXAMPLE 20 

Probes for detection of NFkappaB redistribution. 

Useful for monitoring signalling pathways leading to activation of NFkappaB, e.g. to 
identify compounds which modulate the activity of the pathway in living cells. 

NFkappaB, an activator of transcription, is a component of signalling pathways that are 
10 responsive to a varity of inducers including cytokines, lymphokines, and some 
immunosuppressive agents. 

a) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) is 
amplified using PCR according to standard protocols with primers NFkappaB-top (SEQ 
ID NO:87) and NFkappaB-bottom/+stop (SEQ ID NO:89). The PCR product is 

15 digested with restriction enzymes Xhol and BamHl, and ligated into pEGFP-Cl 

(Clontech, Palo Alto; GenBank Accession number U55763) digested with Xhol and 
BamHl. This produces an EGFP-NFkappaB fusion (SEQ ID NO: 142 & 143) under the 
control of a CMV promoter. 

b) The human NFkappaB p65 subunit gene (GenBank Accession number: M62399) is 

20 amplified using PCR according to standard protocols with primers NFkappaB-top (SEQ 

ID NO:87) and NFkappaB-bottom/-stop (SEQ ID NO:88). The PCR product is digested 
with restriction enzymes Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo 
Alto; GenBank Accession number U55762) digested with Xhol and BamHl. This 
produces an NFkappaB-EGFP fusion (SEQ ID NO:140 & 141) under the control of a 

25 CMV promoter. 

The resulting plasmids are transfected into a suitable cell line, e.g. Jurkat, in which the 
EGFP-NFkappaB probe and/or the NFkappaB-EGFP probe should change its cellular 
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distribution from cytoplasmic to nuclear in response to activation of the signalling pathway 
with e.g. TNFalpha. 

EXAMPLE 21 

5 Probe for detection of RhoA redistribution. 

Useful for monitoring signalling pathways involving RhoA, e.g. to identify compounds 
which modulate the activity of the pathway in living cells. 

RhoA, a small GTPase, is a component of many signalling pathways, e.g. LPA induced 
cytoskeletal rearrangements. 

10 The human RhoA gene (GenBank Accession number: L25080) was amplified using PCR 
according to standard protocols with primers RhoA-top (SEQ ID NO:92) and RhoA- 
bottomZ+stop (SEQ ID NO:93). The PCR product was digested with restriction enzymes 
Hind3 and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank Accession 
number U55763) digested with Hind3and BamHl. This produced an EGFP-RhoA fusion 

15 (SEQ ID NO: 126 &127) under the control of a CMV promoter. 

The resulting plasmid is transfected into a suitable cell line, e.g. Swiss3T3, in which the 
EGFP-RhoA probe should change its cellular distribution from a reasonably homogenous 
to a peripheral distribution within minutes of activation of the signalling pathway with e.g. 
LPA. 

20 

EXAMPLE 22 

Probes for detection of PKB redistribution. 

Useful for monitoring signalling pathways involving PKB e.g. to identify compounds 
which modulate the activity of the pathway in living cells. 

25 PKB, a serine/threonine kinase, is a component in various signalling pathways, many of 
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which are activated by growth factors. 

a) The human PKB gene (GenBank Accession number: M63167) is amplified using PCR 
according to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB- 

5 bottom/+stop (SEQ ID NO:80). The PCR product is digested with restriction enzymes 
Xhol and BamHl, and ligated into pEGFP-Cl (Clontech, Palo Alto; GenBank 
Accession number U55763) digested with Xhol and BamHl. This produces an EGFP- 
PKB fusion (SEQ ID NO: 138 & 139) under the control of a CMV promoter. 

b) The human PKB gene (GenBank Accession number: M63167) was amplified using 

10 PCR according to standard protocols with primers PKB-top (SEQ ID NO:36) and PKB- 
bottomAstop (SEQ ID NO:37) . The PCR product was digested with restriction 
enzymes Xhol and BamHl, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Xhol and BamHl. This produced a PKB- 
EGFP fusion (SEQ ID NO;70 &71) under the control of a CMV promoter. 

15 The resulting plasmids are transfected into a suitable cell line, e.g. CHO expressing the 
human insulin receptor, in which the EGFP-PKB probe and/or the PKB-EGFP probe 
cycles between cytoplasmic and membrane locations during the activation-deactivation 
process following addition of insulin. The transition should be apparent within minutes. 

20 EXAMPLE 23 

Measurement of the real-time redistribution of protein kinase C a isoform-GFP 
fusion (PKCa-GFP) in response to carbamylcholine stimulation of the muscarinic Ml 
receptor; 96 parallel redistribution measurements in microtiter plates. 

BHK cells were stably expressing a recombinant human muscarinic typ 1 receptor, under 
25 the selection with 500 |ig/ml Methotrexate, and also a PKCa-GFP construct (KaA 048), 
under the selection of 500 nM Zeocin. The cells were grown in 96-well plates (Packard 
ViewPlate, black with transparent bottom), washed and preincubated in a Hank's Buffered 
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Salt solution (HBSS) without phenol red, with 20 mM HEPES and 5.5 mM glucose. 

The plate was measured in a FLIPR™ (Fluorescence Imaging Plate Reader) from 
Molecular Devices. The 488 nm emission line from an argon ion laser, run at between 0.4 
and 0.8 W output, was used to excite fluorescence form the GFP. Emission wavelengths 
5 were collected through a 5 1 0 to 565 nm band pass filter. 

The cells were challenged with three doses of carbamylcholine, an Ml receptor agonist 
known from previous studies to give a microscopically detectable redistribution of the 
PKCa-GFP construct [(Almholt et ah 1997)]. Measurements were made every 10 seconds 
for 5 minutes. After data handling including normalisation of baseline fluorescence for the 

10 different wells, background subtraction and averaging the 6 wells used for each 
concentration the data presented in figure 14 were obtained. It can clearly be seen (Fig 14) 
that carbamylcholine gave a time- and dose-dependent, and transient, decrease in 
fluorescence very similar to the time- and dose-dependent profile seen in microscopic 
fluorescence measurements [(see Almholt et ah 1997)]. This experiment was repeated 

15 twice on the same batch of cells with similar results. 

EXAMPLE 24 

Measurement of the real-time redistribution of cyclic-AMP dependent protein kinase 
catalytic subuit-GFP fusion (C-GFP LT ) in response to forskolin stimulation of the 
20 adenylate cyclase; 96 parallel redistribution measurements in microtiter plates. 

CHO cells were stably transfected with hybrid DNA for the PKA catalytic subunit- 
F64L+S65T GFP (C-GFP LT ) fusion protein, and were typically under continuous selection 
with 1000 |ig/ml zeocin (Invitrogen). The cells were grown without selection for 2 days in 
96-well plates (Packard ViewPlate, black with transparent bottom), washed and 
25 preincubated in a Hank's Buffered Salt solution (HBSS) without phenol red, with 20 mM 
HEPES and 5.5 mM glucose. 

The plate was measured in a FLIPR™ (Fluorescence Imaging Plate Reader) from 
Molecular Devices. The 488 nm emission line from an argon ion laser, run at between 0.4 
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and 0.8 W output, was used to excite fluorescence from the GFP. Emission wavelengths 
were collected through a 510 to 565 nm band pass filter. 

The cells were challenged with three doses of forskolin (Fig 15), an adenylate cyclase 
agonist known from previous studies to give a microscopically detectable redistribution of 

5 the C-GFP LT construct [(Almholt et al 1998)]. Measurements were made every 10 seconds 
for over 6 minutes from the point of addition of forskolin. After data handling including 
normalisation of baseline fluorescence for the different wells, background subtraction and 
averaging the 6 wells used for each concentration the data presented below were obtained. 
It can clearly be seen in figure 15 that forskolin gave a time- and dose-dependent decrease 

10 in fluorescence very similar to the time- and dose-dependent profile seen in microscopic 
fluorescence measurements [(see Almholt et al 1998)]. This experiment was repeated 
twice on the same batch of cells with similar results. 

EXAMPLE 25 

15 Measurement of the redistribution response of cyclic-AMP dependent protein kinase 
catalytic subuit-GFP fusion (C-GFP LT ) after forskolin stimulation of the adenylate 
cyclase; measurement of the change in total fluorescence upon permeabilisation of 
agonist-treated cells. 

CHO cells were stably transfected with hybrid DNA for the PKA catalytic subunit- 
20 F64L+ S65T GFP (C-GFP LT ) fusion protein, and were typically under continuous selection 
with 1000 jig/ml zeocin (Invitrogen). For the experiments reported here, cells were grown 
without selection to 90% confluence in 8-well tissue culture-treated Lab-Tek® chambered 
coverglass units (chambers, obtained from Nunc, Inc. Illinois, USA). Immediately prior to 
the experiment growth medium was washed from the cells and replaced with 200 jil 
25 HEPES buffer per well. 

For the results reported here, chambers were measured using a cooled CCD camera 
(KAF1400 chip, Photometries Ltd., USA) attached to an inverted microscope (Diaphot 
300, Nikon, Japan) equipped with a x40 oil-immersion Fluar lens, NA 1.4. Cells were 
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illuminated with 450-490 nm light from a 50 W HBO lamp, and emitted light collected 
between 510-560 nm. 

The cells were challenged with four doses of forskolin, an adenylate cyclase agonist 
known from previous studies to give a microscopically detectable redistribution of the C- 
5 GFP LT construct [(Almholt et al 1998)]. Images were collected at 10-second intervals for a 
period of 10 minutes for each treatment. Six minutes after the addition of forskolin or 
buffer control, Triton-XlOO was added to a final concentration of 0.1%. The detergent 
releases freely mobile C-GFP LT from the cells. The change in fluorescence resulting from 
this loss was measured after 1 minute of equilibration. After data handling including 

10 background subtraction and normalisation to pre-detergent values, the data presented in 
figure 16 were obtained. Permeabilisation caused decreases in fluorescence, the magnitude 
of which were dependent on the forskolin treatments. The dose-dependent profile for 
forskolin activation of the cAMP system as revealed by this method was very similar to 
that registered by other methods (see Almholt et al 1998). This experiment was repeated 

15 twice on the same batch of cells with similar results. 

EXAMPLE 26 

Probe for detection of PKCbeta2 redistribution. 

Useful for monitoring signalling pathways involving protein kinase C, e.g. for identifying 
20 compounds which modulate the activity of the pathway in living cells. 

PKCbeta2, a serine/threonine protein kinase, is closely related to PKCalpha but not 
identical; it is a component of a signalling pathway that is activated by elevation of 
intracellular calcium concomitant with an increase in diacylglycerol species. 

a) The human PKCbeta2 gene (GenBank Accession number: X07109) was amplified using 
25 PCR according to standard protocols with primers PKCbeta2-top (SEQ ID NO: 1 62) and 
PKCbeta2-bottom (SEQ ID NO: 163). The PCR product was digested with restriction 
enzymes Xhol and BamHl, and ligated intopEGFP-Nl (Clontech, Palo Alto; GenBank 
Accession number U55762) digested with Xhol and BamHl. This produces a PKCbeta2- 



22132DK1 Appendix A 



22129DK1 



74 



EGFP fusion (SEQ ID NO: 146 & 147) under the control of a CMV promoter. 

The resulting plasmids are transfected into BHK cells transfected with a human muscarinic 
acetylcholine receptor type ML The cells are grown under standard conditions. The 
fluorescence of the cells is recorded as explained in example 3. Addition of l|iM -IOOjiM 
5 carbachol causes a transient redistribution of fluorescence within the cells whereby it 
changes from a cytosolic location to the plasma membrane. 

EXAMPLE 27 

Probes for detection of PDE4D redistribution. 

10 Useful for monitoring signalling pathways involving Protein Kinase A, e.g. to identify 
compounds which modulate the activity of the pathway in living cells. 

PDE4D3, PDE4D4 and PDE4D5 are closely related splicing variants of PDE4D, a cAMP 
dependent phosphodiesterase. They are components of signalling pathways which involves 
cAMP. 

15 The human PDE4D3, PDE4D4 and PDE4D5 genes (GenBank Accession numbers: 

L20970, L20969 and AFO 12073) are amplified using PCR according to standard protocols 
with the common bottom primer PDE4D-bottom (SEQ ID NO: 159) and PDE4D3-top 
(SEQ ID NO: 156), PDE4D4-top (SEQ ID NO: 157) and PDE4D5-top respectively (SEQ 
ID NO: 158) The PCR products are digested with restriction enzymes Hind3 and EcoRl, 

20 and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank Accession number U55762) 
digested with Hind3 and EcoRL This produces a PDE4D3-EGFP fusion (SEQ ID NO: 154 
& 155), a PDE4D4-EGFP fusion (SEQ ID NO:150 & 151) and a PDE4D5-EGFP fusion 
(SEQ ID NO: 148 & 149), all three under the control of a CMV promoter. 

The resulting plasmids are transfected into MVLEC cells. The cells are grown under 
25 standard conditions. The fluorescence of the cells is recorded as explained in example 3. 
Addition of test compounds may cause a redistribution of fluorescence within the cells 
from an organised cytosolic distribution to a more random one. 
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EXAMPLE 28 

Probes for detection of PDE5 redistribution. 

Useful for monitoring signalling pathways involving Protein Kinase G, e.g. to identify 
5 compounds which modulate the activity of the pathway in living cells. 

PDE5 is a cGMP specific phosphodiesterase. It is a component of a signalling pathway 
which is activated by e.g. nitric oxide. 

a) The human PDE5 gene (GenBank Accession numbers: AJ004865) is amplified using 
PCR according to standard protocols with primers PDE5-top (SEQ ID NO: 160) and PDE5- 
10 bottom (SEQ ID NO: 161). The PCR product is digested with restriction enzymes EcoRl 
and Acc65I, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank Accession 
number U55762) digested with EcoRl and Acc65I. This produces a PDE5-EGFP fusion 
(SEQ ID NO 144 & 145) under the control of a CMV promoter. 

The resulting plasmids are transfected into e.g. A 10 cells. The cells are grown under 
15 standard conditions. The fluorescence of the cells is recorded as explained in example 3. 
Addition of test compounds may cause a redistribution of fluorescence within the cells 
from an organized cytosolic distribution to a more random one. 

EXAMPLE 29 

20 Probe for detection of Ikappa-kinase redistribution. 

The human IKKbeta (GenBank Acc. No. AF031416) is amplified using PCR according to 
standard protocols with primers IKKbeta-top (SEQ ID NO: 164) and IKKbeta-bottom 
(SEQ ID NO: 165). The PCR product is digested with restriction enzymes Hind3 and 
Acc65I, and ligated into pEGFP-Nl (Clontech, Palo Alto; GenBank Accession number 
25 U55762) digested with Hind3 and Acc65I. This produces a IKKbeta-EGFP fusion (SEQ 
ID NO 1 52 & 153) under the control of a CMV promoter. 
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EXAMPLE 30 

Construction of catalytically inactive Erkl probes. 

A catalytically inactive probe has the advantage that it interferes less with the normal 
5 physiology of the cell while retaining its ability to report on activation of a cellular 
signalling pathway by redistribution. 

The Erkl probes described above in Example 3 were subjected to site specific mutagenesis 
which specifically replaced the lysine at amino acid residue number 71 in the native Erkl 
sequence with arginine. This mutation is known to inactivate the catalytic activity of Erkl. 
10 The redistribution patterns of the inactive Erkl probes were identical to the original Erkl 
probes, i.e. they reported on activation of the pathway by redistributing from the cytoplasm 
into the nucleus. The establishment of stable cell lines expressing the probe was facilitated. 
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CLAIMS 

1. A method for extracting quantitative information relating to an influence on a cellular 
response, the method comprising recording variation, caused by the influence on 
mechanically intact or permeabilised living cells, in spatially distributed light emitted 

5 from a luminophore, the luminophore being present in the cells and being capable of 

being redistributed in a manner which is related with the degree of the influence, 
and/or of being modulated by a component which is capable of being redistributed in a 
manner which is related to the degree of the influence, resulting in a modulation of the 
luminescence characteristics of the luminophore, and processing the recorded variation 
10 in the luminescence characteristics to provide quantitative information correlating the 

recorded variation to the degree of the influence on the cellular response. 

2. A method according to claim 1 for extracting quantitative information relating to an 
influence on an intracellular pathway involving redistribution of at least one 
component associated with the pathway, or part thereof, the method comprising 

15 recording the result of the influence on mechanically intact or permeabilised living 

cells, as manifested in spatially distributed light emitted from a luminophore which is 
present in the cells and which is capable of being redistributed, by modulation of the 
pathway, in a manner which is related to the redistribution of the at least one 
component of the intracellular pathway, processing the recorded result to provide 

20 quantitative information correlating the change in the measured property of the light to 

the degree of the influence on the intracellular pathway. 

3. A method according to claim 1 or 2, wherein the quantitative information which is 
indicative of the degree of the cellular response to the influence or the result of the 
influence on the intracellular pathway is extracted from the recorded variation 

25 according to a predetermined calibration based on responses or results, recorded in the 

same manner, to known degrees of a relevant specific influence. 

4. A method according to any of claims 1-3, wherein the influence comprises contact 
between the mechanically intact or permeabilised living cells and a chemical substance 
and/or incubation of the mechanically intact or permeabilised living cells with a 

30 chemical substance. 
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5. A method according to any of claims 1-4, wherein the influence is a substance whose 
effect on an intracellular pathway is to be determined. 

6. A method according to any of claims 1-5, wherein the cells comprise a group of cells 
contained within a spatial limitation. 

5 7. A method according to any of claims 1-5, wherein the cells comprise multiple groups 
of cells contained within multiple spatial limitations. 

8. A method according to any of claims 1-7, wherein the cells comprise multiple groups 
of cells that are qualitatively the same but are subjected to different influences. 

9. A method according to any of claims 1-7, wherein the cells comprise multiple groups 
10 of cells that are qualitatively different but are subjected to the same influence. 

10. A method according to any of claims 1-9, wherein the recording is performed by means 
of a detector capable of measuring total luminescence in a non-spatially resolved 
fashion, the recording comprising a time series of measurements of the total 
luminescence of the cells of one or several of the spatial limitations. 

15 1 1. A method according to claim 10, wherein the signal is measured from individual 
spatial limitations one at a time, the recording being made in the individual spatial 
limitation by means of an apparatus to sequentially position each one of the limitations 
in the field of view of the detector, and repeating the positioning and measuring 
process until all of the spatial limitations have been measured. 

20 1 2. A method according to claim 1 1 , wherein the detector is a photomultiplier tube 
(PMT). 

13. A method according to any of claims 1-9, wherein more than one of the spatial 
limitations are measured simultaneously. 

14. A method according to claim 13, wherein the multiple spatial limitations are measured 
25 simultaneously by means of a one- or two-dimensional array detector, whereby the 

multiple spatial limitations are imaged onto the array detector such that discrete subsets 
of the detecting units (pixels) in the array detector measure the signal from one and 
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only one of the multiple spatial limitations, the signal from any one spatial limitation 
being the combined signal from those pixels that receive the image from one of the 
spatial limitations. 

15. A method according to claim 14, wherein the detector is a linear diode array. 

16. A method according to claim 14, wherein the detector is a video camera. 

17. A method according to claim 14, wherein the detector is a charge transfer device. 

18. A method according to claim 17, wherein the charge transfer device is a charge- 
coupled device. 

19. A method according to any of claims 1-18, wherein the luminophore must be 
illuminated in order to emit light. 

20. A method according to any of claims 13-18, wherein all of the multiple spatial 
limitations are simultaneously illuminated during the measurement operation. 

21. A method according to any of claims 10-18, wherein the individual spatial limitations 
are singly illuminated only during the time period in which they are being measured. 

22. A method according to any of claims 10-18, wherein the illumination is provided by a 
laser which is scanned in a raster fashion over some or all of the spatial limitations 
being measured, the scanning taking place at a rate substantially faster than the 
measurement process such that the illumination appears to the measurement process to 
be continuous in time and spatially uniform over the region being measured. 

23. A method according to any of claims 1-22, wherein the spatial limitations are spatial 
limitations arranged in one or more arrays on a common carrier. 

24. A method according to claim 23, wherein the spatial limitations are wells in a plate of 
microtiter type. 

25. A method according to any of claims 1-22 wherein the spatial limitations are domains 
defined on a substrate on which the cells are present. 
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26. A method according to claim 25 wherein the domains are domains established by the 
presence of the cells on the substrate in a pattern defining the domains. 

27. A method according to claim 25 wherein the domains are domains established by the 
spatial pattern of the influence as it is applied to or contacted with the cells. 

5 28. A method according to any of claims 1-27, wherein the recording is performed at a 
series of points in time, in which the application of the influence occurs at some time 
after the first time point in the series of recordings, the recording being performed, e.g., 
with a predetermined time spacing of from 0.1 seconds to 1 hour, preferably from 1 to 
60 seconds, more preferably from 1 to 30 seconds, in particular from 1 to 10 seconds, 
10 over a time span of from 1 second to 12 hours, such as from 10 seconds to 12 hours, 

e.g., from 10 seconds to one hour, such as from 60 seconds to 30 minutes or 20 
minutes. 

29. A method according to claim 28, wherein the recording is made at two points in time, 
one point being before, and the other point being after the application of the influence. 

15 30. A method according to any of claims 1 -29, wherein the cells are fixed at a point in time 
after the application of the influence at which the response has been predetermined to 
be significant, and the recording is made at an arbitrary later time. 

31. A method according to any of claims 1-30, wherein the luminophore is a luminophore 
that is capable of being redistributed in a manner that is physiologically relevant to the 

20 degree of the influence. 

32. A method according to any of claims 1-30, wherein the luminophore is a luminophore 
which is capable of associating with a component which is capable of being 
redistributed in manner which is physiologically relevant to the degree of the influence. 

33. A method according to any of claims 1-30, wherein the luminophore is a luminophore 
25 which is capable of being redistributed in a manner which is experimentally 

determined to be correlated to the degree of the influence. 

34. A method according to any of claims 1-30, wherein the luminophore is a luminophore 
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which is capable of being redistributed, by modulation of the intracellular pathway, in 
substantially the same manner as the at least one component of the intracellular 
pathway. 

35. A method according to any of claims 1-30, wherein the luminophore is a luminophore 
5 which is capable of being quenched upon spatial association with a component which 

is redistributed by modulation of the pathway, the quenching being measured as a 
decrease in the intensity of the luminescence. 

36. A method according to any of claims 1-30, wherein the variation in spatially 
distributed light emitted by the luminophore is detected by a change in the resonance 

10 energy transfer between the luminophore and another luminescent entity capable of 

delivering energy to the luminophore, each of which has been selected or engineered to 
become part of, bound to or associated with particular components of the intracellular 
pathway, and one of which undergoes redistribution in response to the influence, 
thereby changing the amount of resonance energy transfer, the change in the resonance 

15 energy transfer being measured as a change in the intensity of emission from the 

luminophore. 

37. A method according to any of claims 1-35, wherein the intensity of the light being 
recorded is a function of the fluorescence lifetime, polarisation, wavelength shift, or 
other property which is modulated as a result of the underlying cellular response. 

20 38. A method according to any of claims 1-37, wherein the light to be measured passes 
through a filter which selects the desired component of the light to be measured and 
rejects other components. 

39. A method according to any of claims 2-38, wherein the intracellular pathway is an 
intracellular signalling pathway. 

25 40. A method according to any of claims 1-39, wherein the luminophore is a fluorophore. 

41. A method according to any of claims 1-40, wherein the luminophore is a polypeptide 
encoded by and expressed from a nucleotide sequence harboured in the cells. 
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42. A method according to any of claims 1-41 for detecting intracellular redistribution of a 
biologically active polypeptide affecting intracellular processes upon activation, the 
method comprising 

a) culturing one or more cells containing a nucleotide sequence coding for a hybrid 

5 polypeptide comprising a GFP which is N- or C-terminally tagged, optionally through 

a linker, to a biologically active polypeptide under conditions permitting expression of 
the nucleotide sequence, 

b) modulating the activity of the biologically active polypeptide by incubating the cells 
with a substance having biological activity, and 

10 c) measuring the fluorescence produced by the incubated cells and determining the result 
or variation with respect to the fluorescence, such result or variation being indicative of 
the redistribution of a biologically active polypeptide in said cells. 

43. A method according to claim 42, wherein the luminophore is a hybrid polypeptide 
comprising a fusion of at least a portion of each of two polypeptides one of which 

15 comprises a luminescent polypeptide and the other one of which comprises a 

biologically active polypeptide, as defined herein. 

44. A method according to claim 43, wherein the luminescent polypeptide is a GFP as 
defined herein. 

45. A method according to claim 44, wherein the GFP is selected from the group 

20 consisting of green fluorescent proteins having the F64L mutation as defined herein. 

46. A method according to claim 45, wherein the GFP is a GFP variant selected from the 
group consisting of F64L-GFP, F64L-Y66H-GFP, F64L-S65T-GFP, and EGFP. 

47. A method according to claim 42. wherein the nucleotide sequence is a DNA sequence. 

48. A method according to claims 42-47, wherein the modulation is activation. 
25 49. A method according to claims 42-47, wherein the modulation is deactivation. 

50. A method according to any of claims 1-49, wherein the cells are selected from the 
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group consisting of fungal cells, such as yeast cells; invertebrate cells including insect 
cells; and vertebrate cells, such as mammalian cells. 

5 1 . A method according to claim 50, wherein the mechanically intact or permeabilised 
living cells are mammalian cells which, during the time period over which the 
influence is observed, are incubated at a temperature of 30°C or above, preferably at a 
temperature of from 32°C to 39°C, more preferably at a temperature of from 35°C to 
38°C, and most preferably at a temperature of about 37°C. 

52. A method according to any of claims 1-51, wherein the mechanically intact or 
permeabilised living cells are part of a matrix of identical or non-identical cells. 

53. A method according to any of claims 41-52, wherein the nucleotide sequence has been 
introduced into the cells in the form of a nucleic acid construct coding for a fusion 
polypeptide comprising a biologically active polypeptide that is a component of an 
intracellular signalling pathway, or a part thereof, and a GFP. 

54. A method according to claim 53, wherein the nucleic acid construct is a nucleic acid 
construct coding for a fusion polypeptide comprising a biologically active polypeptide 
that is a component of an intracellular signalling pathway, or a part thereof, and an 
F64L mutant of GFP. 

55. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 53 or 54, wherein the biologically active polypeptide 
is a protein kinase or a phosphatase. 

56. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 53 - 55, wherein the GFP is N- or C-terminally 
tagged, optionally via a peptide linker, to the biologically active polypeptide or part 
thereof. 

57. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 53, 54 or 56, wherein the biologically active 
polypeptide is a transcription factor or a part thereof which changes cellular 
localisation upon activation. 
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58. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 53, 54 or 56, wherein the biologically active 
polypeptide is a protein, or a part thereof, which is associated with the cytoskeletal 
network and which changes cellular localisation upon activation. 

5 59. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to any of claims 53-56, wherein the biologically active 
polypeptide is a protein kinase or a part thereof which changes cellular localisation 
upon activation. 

60. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
10 acid construct according to claim 59, wherein the protein kinase is a serine/threonine 

protein kinase or a part thereof capable of changing intracellular localisation upon 
activation. 

61. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a tyrosine protein 

15 kinase or a part thereof capable of changing intracellular localisation upon activation. 

62. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a phospholipid- 
dependent serine/threonine protein kinase or a part thereof capable of changing 
intracellular localisation upon activation. 

20 63. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a cAMP-dependent 
protein kinase or a part thereof capable of changing cellular localisation upon 
activation. 

64. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
25 acid construct according to claim 63 which codes for a PKAc-F64L-S65T-GFP fusion. 

65. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a cGMP-dependent 
protein kinase or a part thereof capable of changing cellular localisation upon 
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activation. 

66. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a calmodulin- 
dependent serine/threonine protein kinase or a part thereof capable of changing cellular 

5 localisation upon activation. 

67. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a mitogen-activated 
serine/threonine protein kinase or a part thereof capable of changing cellular 
localisation upon activation. 

68. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 67, which codes for an ERK1-F64L-S65T-GFP 
fusion. 

69. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 67, which codes for an EGFP-ERK1 fusion. 

70. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 59, wherein the protein kinase is a cyclin-dependent 
serine/threonine protein kinase or a part thereof capable of changing cellular 
localisation upon activation. 

71. A method according to claim 53 or 54, wherein the nucleic acid construct is a nucleic 
acid construct according to claim 55 or 56, wherein the biologically active polypeptide 
is a protein phosphatase or a part thereof capable of changing cellular localisation upon 
activation. 

72. A method according to claim 53 -71, wherein the nucleic acid construct is a nucleic 
acid construct which is a DNA construct. 

73. A method according to claim 53 -72, wherein the nucleic acid construct is a nucleic 
acid construct according to any of claims 53-72 wherein the gene encoding GFP is 
derived from Aequorea victoria. 
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74. A method according to claim 73, wherein the nucleic acid construct is a nucleic acid 
construct according to claim 73 in which the gene encoding GFP is the gene encoding 
EGFP as defined herein. 

75. A method according to claim 73, wherein the nucleic acid construct is a nucleic acid 
construct according to claim 73 in which the gene encoding a GFP is a gene encoding a 
GFP variant selected from F64L-GFP, F64L-Y66H-GFP and F64L-S65T-GFP. 

76. A method according to claims 72 and 74, wherein the nucleic acid construct is a DNA 
construct according to claims 72 and 74 or, where applicable, 75, which is a construct 
as identified by any of the DNA sequences shown in SEQ ID NO: 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 108, 1 10, 1 12, 1 14, 116, 
118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 
and 152 or is a variant thereof capable of encoding the same fusion polypeptide or a 
fusion polypeptide which is biologically equivalent thereto, as defined herein. 

77. A method comprising a cell containing a nucleic acid construct according to any of 
claims 53-76 and capable of expressing the sequence encoded by the construct. 

78. A method comprising a cell according to claim 77, which is a eukaryotic cell. 

79. A method comprising a cell according to claim 77, which is selected from the group 
consisting of fungal cells, such as yeast cells; invertebrate cells, including insect cells, 
and vertebrate cells, such as mammalian cells. 

80. A method according to any of claims 1-79, as used in a screening program as defined 
herein. 

81. A method according claim 80, wherein the method is a screening program for the 
identification of a biologically active substance as defined herein that directly or 
indirectly affects an intracellular signalling pathway and is potentially useful as a 
medicament, wherein the result of the individual measurement of each substance being 
screened which indicates its potential biological activity is based on measurement of 
the redistribution of spatially resolved luminescence in living cells and which 
undergoes a change in distribution upon activation of an intracellular signalling 
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pathway. 

82. A method according to claim 80, wherein the method is a screening program for the 
identification of a biologically toxic substance as defined herein that exerts its toxic 
effect by interfering with an intracellular signalling pathway, wherein the result of the 
individual measurement of each substance being screened which indicates its potential 
biologically toxic activity is based on measurement of the redistribution of said 
fluorescent probe in living cells and which undergoes a change in distribution upon 
activation of an intracellular signalling pathway. 

83. A method according to any of claims 1-82 wherein a fluorescent probe is used in back- 
tracking of signal transduction pathways as defined herein. 

84. A method according to any of claims 1-83, for treating a condition or disease related to 
the intracellular function of a protein kinase comprising administering to a patient 
suffering from said condition or disease an effective amount of a compound which has 
been discovered by any method. 

85. A compound that modulates a component of an intracellular pathway as defined herein, 
as determined by any method according to any of claims 1-83. 

86. A medical composition comprising a therapeutic amount of a compound identified 
according to any method according to any of claims 1-83. 

87. A method of selectively treating a patient suffering from an ailment which responds to 
medical treatment comprising obtaining a primary cells from said patient, transfecting 
the cells with at least one DNA sequence encoding a fluorescent probe according to 
any of the preceding claims, culturing the cells under conditions permitting the 
expression of said probes and exposing it to an array of medicaments suspected of 
being capable of alleviating said ailment, then comparing changes in fluorescence 
patterns or redistribution patterns of the fluorescent probes in the intact living cells to 
detect the cellular response to the specific medicaments (obtaining a cellular action 
profile), then selecting a medicament(s) based on desired activity and acceptable level 
of side effects and administering an effective amount of said medicament(s) to said 
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patient. 

88. A method according to any of claims 1-83 of identifying a drug target among the g 
of biologically active polypeptides that are components of intracellular signalling 
pathways. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: NovoNordisk, Biolmage 

(ii) TITLE OF THE INVENTION: An Improved Method of Detecting Cellular 

Translocation of Biologically Active Polypeptides Using 
Fluorescense Imaging 

(iii) NUMBER OF SEQUENCES: 165 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NovoNordisk, Biolmage 

(B) STREET: Morkho jbygade 28 

(C) CITY: Soborg 

(D) STATE: DK 

(E) COUNTRY: DENMARK 

(F) ZIP: 2860 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: , PV&P R 

(B) REGISTRATION NUMBER: 

(C) REFERENCE / DOCKET NUMBER: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TTGGACACAA GCTTTGGACA CGGCGCGCCA TGAGTAAAGG AGAAGAACTT TTC 53 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GTCATCTTCT CGAGTCTTAC TCCTGAGGTT TGTATAGTTC ATCCATGCCA TGT 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGGCAACG CCGCCGCCGC CAAG 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTCATCTTCT CGAGTCTTTC AGGCGCGCCC AAACTCAGTA AACTCCTTGC CACAC 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGCTGACG TTTACCCGGC CAACG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTCATCTTCT CGAGTCTTTC AGGCGCGCCC TACTGCACTT TGCAAGATTG GGTGC 
(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TTGGACACAA GCTTTGGACA CCCTCAGGAT ATGGCGGCGG CGGCGGCGGC TCCGGGGGGC 
GGGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GTCATCTTCT CGAGTC TTTC AGGCGCGCCC GGGGCCCTCT GGCGCCCCTG GCTGG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
TAGAATTCAA CCATGGCGGC GGCGGCGGCG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TAGGATCCCT AGGGGGCCTC CAGCACTCC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: II: 
TAC TCGAGT A AC C ATGGCGG CGGCGGCGGC G 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
TAGGATCCAT AGATCTGTAT CCTGG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDr JESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TAGGATCCTT AAGATCTGTA TCCTGG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED! JESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
ATC TCGAGGG AAAATGTCTC AGGAGAGG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
ATGGATCCTC GGACTCCATC TCTTCTTG 

(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
ATGGATCCTC AGGACTCCAT CTCTTCTTG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GTCTCGAGCC ATCATGAGCA GAAGCAAG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GTGGATCCCA CTGCTGCACC TGTGCTA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GTGGATCC TC ACTGC TGC AC C TGTGCTA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEWtjTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CGCGAATTCC GCCACCATGA GTGCTGAGGG GTACCAGTAC 

( 2 ) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 
CGCGGATCCT GTCGCCTCTG CTGTGCATAT AC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: p85-top-C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 

GGGAGATCTA TGAGTGCTGA GGGGTACCAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 
GGGCGGATCC TCATCGCCTC TGCTGTGCAT ATAC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24 



GTGAATTCGA CCATGTCGTC CATCTTGCCA TTC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

GTGGTACCCA TGACATGCTT GAGCAACGCA C 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 32 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GTGGTACCTT ATGACATGCT TGAGCAACGC AC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
GTGAATTCGT CAATGGAGCT GGAAAACATC G 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

GTGGATCCCT GCTGCTTCCG GTGGAGTTCG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 31 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GTGGATCCCT AGCTGCTTCC GGTGGAGTTC G 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
GTAGATCTAC CATGGCGGGC TGG ATC CAGG CC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GTGGTACCCA TGAGAGGGAG CCTCTGGCAG A 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
GTGGTACCTC ATGAGAGGGA GCCTCTGGCA G 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
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GTGAATTCAA CCATGGACAA TATGTC TATT ACG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GTGGATCCCA GTCTAAAGGT TGTGGGTCTG C 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
GTGGATCCTC AGTCTAAAGG TTGTGGGTCT GC 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
GTCTCGAGGC AC C ATGAGCG ACGTGGC 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 
TGGGATCCGA GGCCGTGCTG CTGGCCG 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1 . . .1891 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 



528 



// 



GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGO ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 67 2 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCA ACC ATG GCG GCG GCG 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ala Ala Ala 
245 250 255 

GCG GCT CAG GGG GGC GGG GGC GGG GAG CCC CGT AGA ACC GAG GGG GTC 816 
Ala Ala Gin Gly Gly Gly Gly Gly Glu Pro Arg Arg Thr Glu Gly Val 
260 265 270 

GGC CCG GGG GTC CCG GGG GAG GTG GAG ATG GTG AAG GGG CAG CCG TTC 864 
Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 
275 280 285 

GAC GTG GGC CCG CGC TAC ACG CAG TTG CAG TAC ATC GGC GAG GGC GCG 912 
Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr He Gly Glu Gly Ala 
290 295 300 

TAC GGC ATG GTC AGC TCG GCC TAT GAC CAC GTG CGC AAG ACT CGC GTG 960 
Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val Arg Lys Thr Arg Val 
305 310 315 320 

GCC ATC AAG AAG ATC AGC CCC TTC GAA CAT CAG ACC TAC TGC CAG CGC 1008 
Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin Arg 
325 330 335 

ACG CTC CGG GAG ATC CAG ATC CTG CTG CGC TTC CGC CAT GAG AAT GTC 1056 
Thr Leu Arg Glu He Gin He Leu Leu Arg Phe Arg His Glu Asn Val 
340 345 350 

ATC GGC ATC CGA GAC ATT CTG CGG GCG TCC ACC CTG GAA GCC ATG AGA 1104 
He Gly He Arg Asp He Leu Arg Ala Ser Thr Leu Glu Ala Met Arg 
355 360 365 

GAT GTC TAC ATT GTG CAG GAC CTG ATG GAG ACT GAC CTG TAC AAG TTG 1152 
Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys Leu 
370 375 380 

CTG AAA AGC CAG CAG CTG AGC AAT GAC CAT ATC TGC TAC TTC CTC TAC 12 00 

Leu Lys Ser Gin Gin Leu Ser Asn Asp His He Cys Tyr Phe Leu Tyr 
385 390 395 400 

CAG ATC CTG CGG GGC CTC AAG TAC ATC CAC TCC GCC AAC GTG CTC CAC 1248 
Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu His 



405 410 415 

CGA GAT CTA AAG CCC TCC AAC CTG CTC AGC AAC ACC ACC TGC GAC CTT 1296 
Arg Asp Leu Lys Pro Ser Asn Leu Leu Ser Asn Thr Thr Cys Asp Leu 
420 425 430 

AAG ATT TGT GAT TTC GGC CTG GCC CGG ATT GCC GAT CCT GAG CAT GAC 1344 
Lys lie Cys Asp Phe Gly Leu Ala Arg lie Ala Asp Pro Glu His Asp 
435 440 445 

CAC ACC GGC TTC CTG ACG GAG TAT GTG GCT ACG CGC TGG TAC CGG GCC 13 92 

His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg Ala 
450 455 460 

CCA GAG ATC ATG CTG AAC TCC AAG GGC TAT ACC AAG TCC ATC GAC ATC 1440 
Pro Glu lie Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser lie Asp lie 
465 470 475 480 

TGG TCT GTG GGC TGC ATT CTG GCT GAG ATG CTC TCT AAC CGG CCC ATC 1488 
Trp Ser Val Gly Cys lie Leu Ala Glu Met Leu Ser Asn Arg Pro lie 
485 490 495 

TTC CCT GGC AAG CAC TAC CTG GAT CAG CTC AAC CAC ATT CTG GGC ATC 1536 
Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His lie Leu Gly lie 
500 505 510 

CTG GGC TCC CCA TCC CAG GAG GAC CTG AAT TGT ATC ATC AAC ATG AAG 1584 
Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys lie lie Asn Met Lys 
515 520 525 

GCC CGA AAC TAC CTA CAG TCT CTG CCC TCC AAG ACC AAG GTG GCT TGG 1632 
Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys Thr Lys Val Ala Trp 
530 535 540 

GCC AAG CTT TTC CCC AAG TCA GAC TCC AAA GCC CTT GAC CTG CTG GAC 1680 
Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala Leu Asp Leu Leu Asp 
545 550 555 560 

CGG ATG TTA ACC TTT AAC CCC AAT AAA CGG ATC ACA GTG GAG GAA GCG 1728 
Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie Thr Val Glu Glu Ala 
565 570 575 

CTG GCT CAC CCC TAC CTG GAG CAG TAC TAT GAC CCG ACG GAT GAG CCA 1776 
Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Thr Asp Glu Pro 
580 585 590 

GTG GCC GAG GAG CCC TTC ACC TTC GCC ATG GAG CTG GAT GAC CTA CCT 1824 
Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 
595 600 605 

AAG GAG CGG CTG AAG GAG CTC ATC TTC CAG GAG ACA GCA CGC TTC CAG 1872 
Lys Glu Arg Leu Lys Glu Leu lie Phe Gin Glu Thr Ala Arg Phe Gin 
610 615 620 

CCC GGA GTG CTG GAG GCC C CCTAG 1896 
Pro Gly Val Leu Glu Ala Pro 
625 630 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ala Ala Ala 

245 250 255 

Ala Ala Gin Gly Gly Gly Gly Gly Glu Pro Arg Arg Thr Glu Gly Val 

260 265 270 

Gly Pro Gly Val Pro Gly Glu Val Glu Met Val Lys Gly Gin Pro Phe 

275 280 285 

Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr He Gly Glu Gly Ala 

290 295 300 

Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val Arg Lys Thr Arg Val 
305 310 315 320 

Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin Arg 

325 330 335 

Thr Leu Arg Glu He Gin He Leu Leu Arg Phe Arg His Glu Asn Val 
340 345 350 
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lie Gly He Arg Asp He Leu Arg Ala Ser Thr Leu Glu Ala Met Arg 

355 360 365 

Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys Leu 

370 375 380 

Leu Lys Ser Gin Gin Leu Ser Asn Asp His lie Cys Tyr Phe Leu Tyr 
385 390 395 400 

Gin lie Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu His 

405 410 415 

Arg Asp Leu Lys Pro Ser Asn Leu Leu Ser Asn Thr Thr Cys Asp Leu 

420 425 430 

Lys He Cys Asp Phe Gly Leu Ala Arg He Ala Asp Pro Glu His Asp 

435 440 445 

His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg Ala 

450 455 460 

Pro Glu lie Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp He 
465 470 475 480 

Trp Ser Val Gly Cys lie Leu Ala Glu Met Leu Ser Asn Arg Pro He 

485 490 495 

Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His lie Leu Gly lie 

500 505 510 

Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys He lie Asn Met Lys 

515 520 525 

Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys Thr Lys Val Ala Trp 

530 535 540 

Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala Leu Asp Leu Leu Asp 
545 550 555 560 

Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie Thr Val Glu Glu Ala 

565 570 575 

Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Thr Asp Glu Pro 

580 585 590 

Val Ala Glu Glu Pro Phe Thr Phe Ala Met Glu Leu Asp Asp Leu Pro 

595 600 605 

Lys Glu Arg Leu Lys Glu Leu He Phe Gin Glu Thr Ala Arg Phe Gin 

610 615 620 

Pro Gly Val Leu Glu Ala Pro 
625 630 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .1815 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 
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GTC GAG CTG GAC GGC GAC GTA MC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 33 6 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GTA ACC ATG GCG GCG GCG GCG GCG GCG GGC CCG 7 68 

Gly Leu Arg Ser Arg Val Thr Met Ala Ala Ala Ala Ala Ala Gly Pro 



245 250 255 

GAG ATG GTC CGC GGG CAG GTG TTC GAC GTG GGG CCG CGC TAC ACT AAT 816 
Glu Met Val Arg Gly Gin Val Phe Asp Val Gly Pro Arg Tyr Thr Asn 
260 265 270 

CTC TCG TAC ATC GGA GAA GGC GCC TAC GGC ATG GTT TGT TCT GCT TAT 864 
Leu Ser Tyr He Gly Glu Gly Ala Tyr Gly Met Val Cys Ser Ala Tyr 
275 280 285 

GAT AAT CTC AAC AAA GTT CGA GTT GCT ATC AAG AAA ATC AGT CCT TTT 912 
Asp Asn Leu Asn Lys Val Arg Val Ala He Lys Lys He Ser Pro Phe 
290 295 300 

GAG CAC CAG ACC TAC TGT CAG AGA ACC CTG AGA GAG ATA AAA ATC CTA 960 
Glu His Gin Thr Tyr Cys Gin Arg Thr Leu Arg Glu He Lys lie Leu 
305 310 315 320 

CTG CGC TTC AGA CAT GAG AAC ATC ATC GGC ATC AAT GAC ATC ATC CGG 1008 
Leu Arg Phe Arg His Glu Asn He He Gly He Asn Asp lie lie Arg 
325 330 335 

GCA CCA ACC ATT GAG CAG ATG AAA GAT GTA TAT ATA GTA CAG GAC CTC 1056 
Ala Pro Thr He Glu Gin Met Lys Asp Val Tyr lie Val Gin Asp Leu 
340 345 350 

ATG GAG ACA GAT CTT TAC AAG CTC TTG AAG ACA CAG CAC CTC AGC AAT 1104 
Met Glu Thr Asp Leu Tyr Lys Leu Leu Lys Thr Gin His Leu Ser Asn 
355 360 365 

GAT CAT ATC TGC TAT TTT CTT TAT CAG ATC CTG AGA GGA TTA AAG TAT 1152 
Asp His He Cys Tyr Phe Leu Tyr Gin lie Leu Arg Gly Leu Lys Tyr 
370 375 380 

ATA CAT TCA GCT AAT GTT CTG CAC CGT GAC CTC AAG CCT TCC AAC CTC 1200 
He His Ser Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser Asn Leu 
385 390 395 400 

CTG CTG AAC ACC ACT TGT GAT CTC AAG ATC TGT GAC TTT GGC CTT GCC 1248 
Leu Leu Asn Thr Thr Cys Asp Leu Lys lie Cys Asp Phe Gly Leu Ala 
405 410 415 

CGT GTT GCA GAT CCA GAC CAT GAT CAT ACA GGG TTC TTG ACA GAG TAT 1296 
Arg Val Ala Asp Pro Asp His Asp His Thr Gly Phe Leu Thr Glu Tyr 
420 425 430 

GTA GCC ACG CGT TGG TAC AGA GCT CCA GAA ATT ATG TTG AAT TCC AAG 13 44 
Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu lie Met Leu Asn Ser Lys 
435 440 445 

GGT TAT ACC AAG TCC ATT GAT ATT TGG TCT GTG GGC TGC ATC CTG GCA 13 92 
Gly Tyr Thr Lys Ser He Asp He Trp Ser Val Gly Cys He Leu Ala 
450 455 460 

GAG ATG CTA TCC AAC AGG CCT ATC TTC CCA GGA AAG CAT TAC CTT GAC 1440 
Glu Met Leu Ser Asn Arg Pro He Phe Pro Gly Lys His Tyr Leu Asp 
465 470 475 480 
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CAG CTG AAT 
Gin Leu Asn 



CTG AAT TGT 
Leu Asn Cys 

CCG CAC AAA 
Pro His Lys 
515 

TCC AAA GCT 
Ser Lys Ala 
530 

AAG AGG ATT 
Lys Arg He 
545 

TAT TAT GAC 
Tyr Tyr Asp 

GAC ATG GAG 
Asp Met Glu 



TTT GAA GAG 
Phe Glu Glu 
595 



CAC ATC CTG 
His He Leu 
485 

ATA ATA AAT 
He He Asn 
500 

AAT AAG GTG 
Asn Lys Val 

CTG GAT TTA 
Leu Asp Leu 

GAA GTT GAA 
Glu Val Glu 
550 

CCA AGT GAT 
Pro Ser Asp 
565 

CTG GAC GAC 
Leu Asp Asp 
580 

ACT GCT CGA 
Thr Ala Arg 



GGT ATT CTT 
Gly He Leu 

TTA AAA GCT 
Leu Lys Ala 
505 

CCG TGG AAC 
Pro Trp Asn 
520 

CTG GAT AAA 
Leu Asp Lys 
535 

CAG GCT CTG 
Gin Ala Leu 



GAG CCC ATT 
Glu Pro He 



TTA CCT AAG 
Leu Pro Lys 
585 

TTC CAG CCA 
Phe Gin Pro 
600 



GGA TCT CCA 
Gly Ser Pro 
490 

AGA AAC TAT 
Arg Asn Tyr 



AGG TTG TTC 
Arg Leu Phe 



ATG TTG ACA 
Met Leu Thr 
540 

GCC CAC CCG 
Ala His Pro 
555 

GCT GAA GCA 
Ala Glu Ala 
570 

GAG AAG CTC 
Glu Lys Leu 

GGA TAC AGA 
Gly Tyr Arg 



TCA CAG GAA 
Ser Gin Glu 
495 

TTG CTT TCT 
Leu Leu Ser 
510 

CCA AAC GCT 
Pro Asn Ala 
525 

TTT AAC CCT 
Phe Asn Pro 



TAC CTG GAG 
Tyr Leu Glu 



CCA TTC AAG 
Pro Phe Lys 
575 

AAA GAA CTC 
Lys Glu Leu 
590 

TCT TAA 

Ser 

605 



GAT 1488 
Asp 

CTC 1536 
Leu 



GAC 1584 
Asp 



CAC 1632 
His 



CAG 1680 

Gin 

560 

TTT 1728 
Phe 



ATT 1776 
He 



1818 



(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 605 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



Met Val Ser Lys 
1 

Val Glu Leu Asp 

20 

Glu Gly Glu Gly 
35 

Cys Thr Thr Gly 
50 

Leu Thr Tyr Gly 
65 

Gin His Asp Phe 



Gly Glu Glu Leu 
5 

Gly Asp Val Asn 

Asp Ala Thr Tyr 
40 

Lys Leu Pro Val 
55 

Val Gin Cys Phe 
70 

Phe Lys Ser Ala 

85 



Phe Thr Gly Val 
10 

Gly His Lys Phe 
25 

Gly Lys Leu Thr 

Pro Trp Pro Thr 
60 

Ser Arg Tyr Pro 
75 

Met Pro Glu Gly 
90 



Val Pro He Leu 
15 

Ser Val Ser Gly 
30 

Leu Lys Phe He 
45 

Leu Val Thr Thr 

Asp His Met Lys 
80 

Tyr Val Gin Glu 
95 



Arg Thr lie Phe 
100 

Val Lys Phe Glu 
115 

lie Asp Phe Lys 
130 

Asn Tyr Asn Ser 
145 

Gly He Lys Val 

Val Gin Leu Ala 
180 

Pro Val Leu Leu 
195 

Ser Lys Asp Pro 
210 

Val Thr Ala Ala 
225 

Gly Leu Arg Ser 

Glu Met Val Arg 
260 

Leu Ser Tyr He 
275 

Asp Asn Leu Asn 
290 

Glu His Gin Thr 
305 

Leu Arg Phe Arg 

Ala Pro Thr He 
340 

Met Glu Thr Asp 
355 

Asp His He Cys 
370 

lie His Ser Ala 
385 

Leu Leu Asn Thr 

Arg Val Ala Asp 
420 

Val Ala Thr Arg 
435 

Gly Tyr Thr Lys 
450 

Glu Met Leu Ser 
465 

Gin Leu Asn His 

Leu Asn Cys He 

500 

Pro His Lys Asn 
515 

Ser Lys Ala Leu 
530 

Lys Arg He Glu 
545 



Phe Lys Asp Asp 

Gly Asp Thr Leu 
120 

Glu Asp Gly Asn 
135 

His Asn Val Tyr 
150 

Asn Phe Lys He 
165 

Asp His Tyr Gin 

Pro Asp Asn His 
200 

Asn Glu Lys Arg 
215 

Gly He Thr Leu 
230 

Arg Val Thr Met 
245 

Gly Gin Val Phe 

Gly Glu Gly Ala 
280 

Lys Val Arg Val 
295 

Tyr Cys Gin Arg 
310 

His Glu Asn He 
325 

Glu Gin Met Lys 

Leu Tyr Lys Leu 
360 

Tyr Phe Leu Tyr 
375 

Asn Val Leu His 
390 

Thr Cys Asp Leu 
405 

Pro Asp His Asp 

Trp Tyr Arg Ala 
440 

Ser He Asp He 
455 

Asn Arg Pro lie 
470 

He Leu Gly He 
485 

lie Asn Leu Lys 

Lys Val Pro Trp 
520 

Asp Leu Leu Asp 
535 

Val Glu Gin Ala 
550 
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Gly Asn Tyr Lys 
105 

Val Asn Arg lie 

lie Leu Gly His 
140 

He Met Ala Asp 
155 

Arg His Asn lie 
170 

Gin Asn Thr Pro 
185 

Tyr Leu Ser Thr 

Asp His Met Val 
220 

Gly Met Asp Glu 
235 

Ala Ala Ala Ala 
250 

Asp Val Gly Pro 
265 

Tyr Gly Met Val 

Ala He Lys Lys 

300 

Thr Leu Arg Glu 
315 

He Gly lie Asn 
330 

Asp Val Tyr He 
345 

Leu Lys Thr Gin 

Gin lie Leu Arg 
380 

Arg Asp Leu Lys 
395 

Lys He Cys Asp 
410 

His Thr Gly Phe 
425 

Pro Glu lie Met 

Trp Ser Val Gly 
460 

Phe Pro Gly Lys 
475 

Leu Gly Ser Pro 
490 

Ala Arg Asn Tyr 
505 

Asn Arg Leu Phe 

Lys Met Leu Thr 
540 

Leu Ala His Pro 
555 



Thr Arg Ala Glu 
110 

Glu Leu Lys Gly 
125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 
160 

Glu Asp Gly Ser 
175 

He Gly Asp Gly 
190 

Gin Ser Ala Leu 
205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 
240 

Ala Ala Gly Pro 
255 

Arg Tyr Thr Asn 
270 

Cys Ser Ala Tyr 
285 

lie Ser Pro Phe 

lie Lys lie Leu 
320 

Asp He lie Arg 
335 

Val Gin Asp Leu 
350 

His Leu Ser Asn 
365 

Gly Leu Lys Tyr 

Pro Ser Asn Leu 
400 

Phe Gly Leu Ala 
415 

Leu Thr Glu Tyr 
430 

Leu Asn Ser Lys 
445 

Cys He Leu Ala 

His Tyr Leu Asp 
480 

Ser Gin Glu Asp 
495 

Leu Leu Ser Leu 
510 

Pro Asn Ala Asp 
525 

Phe Asn Pro His 

Tyr Leu Glu Gin 
560 
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Tyr Tyr Asp Pro Ser Asp Glu Pro He Ala Glu Ala Pro Phe Lys Phe 

565 570 575 

Asp Met Glu Leu Asp Asp Leu Pro Lys Glu Lys Leu Lys Glu Leu He 

580 585 590 

Phe Glu Glu Thr Ala Arg Phe Gin Pro Gly Tyr Arg Ser 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 
<B) LOCATION: 1. . .2526 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG TCA ATG GAG CTG GAA 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ser Met Glu Leu Glu 
245 250 255 

AAC ATC GTG GCC AAC ACG GTC TTG CTG AAA GCC AGG GAA GGG GGC GGA 816 
Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg Glu Gly Gly Gly 
260 265 270 

GGA AAG CGC AAA GGG AAA AGC AAG AAG TGG AAA GAA ATC CTG AAG TTC 864 
Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu lie Leu Lys Phe 
275 280 285 

CCT CAC ATT AGC CAG TGT GAA GAC CTC CGA AGG ACC ATA GAC AGA GAT 912 
Pro His lie Ser Gin Cys Glu Asp Leu Arg Arg Thr He Asp Arg Asp 
290 295 300 

TAC TGC AGT TTA TGT GAC AAG CAG CCA ATC GGG AGG CTG CTT TTC CGG 960 
Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg Leu Leu Phe Arg 
305 310 315 320 

CAG TTT TGT GAA ACC AGG CCT GGG CTG GAG TGT TAC ATT CAG TTC CTG 1008 
Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr He Gin Phe Leu 
325 330 335 

GAC TCC GTG GCA GAA TAT GAA GTT ACT CCA GAT GAA AAA CTG GGA GAG 1056 
Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu Lys Leu Gly Glu 
340 345 350 

AAA GGG AAG GAA ATT ATG ACC AAG TAC CTC ACC CCA AAG TCC CCT GTT 1104 
Lys Gly Lys Glu He Met Thr Lys Tyr Leu Thr Pro Lys Ser Pro Val 
355 360 365 
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TTC ATA GCC 
Phe He Ala 
370 

CTC CTA CAG 
Leu Leu Gin 
385 

GTC CAC GAG 
Val His Glu 



ATG TTT TTT 
Met Phe Phe 



GTG ACC AAA 
Val Thr Lys 
435 

TTC GGG GAG 
Phe Gly Glu 
450 

GCC TGC AAG 
Ala Cys Lys 
465 

TCC ATG GCC 
Ser Met Ala 



TTT GTG GTC 
Phe Val Val 



TTG GTC CTG 
Leu Val Leu 
515 

AAC ATG GGC 
Asn Met Gly 
530 

GCA GAG ATC 
Ala Glu He 
545 

TAC CGA GAT 
Tyr Arg Asp 



ATT AGG ATC 
He Arg He 



CTG ATC CGC 
Leu He Arg 



CAA GTT GGC 
Gin Val Gly 



AAG CCG TGC 
Lys Pro Cys 
390 

TAC CTG AGG 
Tyr Leu Arg 
405 

GAC CGC TTT 
Asp Arg Phe 
420 

AAC ACT TTC 
Asn Thr Phe 



GTC TGT GCC 
Val Cys Ala 



CGC TTG GAG 
Arg Leu Glu 
470 

CTC AAT GAG 
Leu Asn Glu 
485 

AAC CTG GCC 
Asn Leu Ala 
500 

ACC ATC ATG 
Thr He Met 



AAC CCT GGC 
Asn Pro Gly 



CTC TGC GGC 
Leu Cys Gly 
550 

CTG AAA CCT 
Leu Lys Pro 
565 

TCA GAC CTG 
Ser Asp Leu 
580 

GGC CGG GTG 
Gly Arg Val 



CAA GAC CTG 
Gin Asp Leu 
375 

AAA GAA CTC 
Lys Glu Leu 

GGA GAA CCA 
Gly Glu Pro 

CTC CAG TGG 
Leu Gin Trp 
425 

AGG CAG TAT 
Arg Gin Tyr 
440 

TGC CAG GTT 
Cys Gin Val 
455 

AAG AAG AGG 
Lys Lys Arg 



AAG CAG ATC 
Lys Gin lie 



TAT GCC TAC 
Tyr Ala Tyr 
505 

AAT GGG GGT 
Asn Gly Gly 
520 

TTC GAG GAG 
Phe Glu Glu 
535 

TTA GAA GAC 
Leu Glu Asp 



GAA AAC ATC 
Glu Asn lie 



GGC TTG GCT 
Gly Leu Ala 
585 

GGC ACT GTT 
Gly Thr Val 



GTC TCC CAG 
Val Ser Gin 
380 

TTT TCT GCC 
Phe Ser Ala 
395 

TTC CAC GAA 
Phe His Glu 
410 

AAG TGG TTG 
Lys Trp Leu 



CGA GTG CTA 
Arg Val Leu 



CGG GCC ACG 
Arg Ala Thr 
460 

ATC AAA AAG 
He Lys Lys 
475 

CTC GAG AAG 
Leu Glu Lys 
490 

GAG ACC AAG 
Glu Thr Lys 

GAC CTG AAG 
Asp Leu Lys 



GAG CGG GCC 
Glu Arg Ala 
540 

CTC CAC CGT 
Leu His Arg 
555 

CTG TTA GAT 
Leu Leu Asp 
570 

GTG AAG ATC 
Val Lys lie 



GGC TAC ATG 
Gly Tyr Met 



ACG GAG GAG 
Thr Glu Glu 



TGT GCA CAG 
Cys Ala Gin 



TAT CTG GAC 
Tyr Leu Asp 
415 

GAA AGG CAA 
Glu Arg Gin 
430 

GGA AAA GGG 
Gly Lys Gly 
445 

GGT AAA ATG 
Gly Lys Met 



AGG AAA GGG 
Arg Lys Gly 



GTC AAC AGT 
Val Asn Ser 
495 

GAT GCA CTG 
Asp Ala Leu 
510 

TTC CAC ATC 
Phe His He 
525 

TTG TTT TAT 
Leu Phe Tyr 

GAG AAC ACC 
Glu Asn Thr 



GAT TAT GGC 
Asp Tyr Gly 
575 

CCC GAG GGA 
Pro Glu Gly 
590 

GCC CCC GAA 
Ala Pro Glu 



AAG 1152 
Lys 



TCT 12 00 

Ser 

400 

AGC 1248 
Ser 



CCG 12 96 
Pro 



GGC 1344 
Gly 



TAT 1392 
Tyr 



GAG 1440 

Glu 

480 

CAG 14 88 
Gin 



TGC 1536 
Cys 



TAC 1584 
Tyr 



GCG 1632 
Ala 



GTC 1680 

val 

560 

CAC 1728 
His 



GAC 1776 
Asp 

GTC 1824 
Val 



'J V 



595 600 605 

CTG AAC AAC CAG AGG TAC GGC CTG AGC CCC GAC TAC TGG GGC CTT GGC 1872 
Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr Trp Gly Leu Gly 
610 615 620 

TGC CTC ATC TAT GAG ATG ATC GAG GGC CAG TCG CCG TTC CGC GGC CGT 1920 
Cys Leu He Tyr Glu Met lie Glu Gly Gin Ser Pro Phe Arg Gly Arg 
625 630 635 640 

AAG GAG AAG GTG AAG CGG GAG GAG GTG GAC CGC CGG GTC CTG GAG ACG 1968 
Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg Val Leu Glu Thr 
645 650 655 

GAG GAG GTG TAC TCC CAC AAG TTC TCC GAG GAG GCC AAG TCC ATC TGC 2016 
Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala Lys Ser lie Cys 
660 665 670 

AAG ATG CTG CTC ACG AAA GAT GCG AAG CAG AGG CTG GGC TGC CAG GAG 2064 
Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu Gly Cys Gin Glu 
675 680 685 

GAG GGG GCT GCA GAG GTC AAG AGA CAC CCC TTC TTC AGG AAC ATG AAC 2112 
Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe Arg Asn Met Asn 
690 695 700 

TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC TTC GTT CCA GAC 2160 
Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro Phe Val Pro Asp 
705 710 715 720 

CCC CGC GCT GTG TAC TGT AAG GAC GTG CTG GAC ATC GAG CAG TTC TCC 2208 
Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp He Glu Gin Phe Ser 
725 730 735 

ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC GAC TTC TAC TCC 22 56 
Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp Asp Phe Tyr Ser 
740 745 750 

AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA AAC GAG ATG ATA 2304 
Lys Phe Ser Thr Gly Ser Val Ser He Pro Trp Gin Asn Glu Met He 
755 760 765 

GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA CCT AAT GGT ACC 2352 
Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly Pro Asn Gly Thr 
770 775 780 

CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA CCG CCC AAG AAA 2400 
Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu Pro Pro Lys Lys 
785 790 795 800 

GGG CTG CTC CAG AGA CTC TTC AAG CGG CAG CAT CAG AAC AAT TCC AAG 2448 
Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin Asn Asn Ser Lys 
805 810 815 

AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC ATA AAC TCA AAC 2496 
Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His He Asn Ser Asn 
820 825 830 
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CAT GTC AGC TCG AAC TCC ACC GGA AGC AGC TAG 2 52 9 

His Val Ser Ser Asn Ser Thr Gly Ser Ser 
835 840 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ser Met Glu Leu Glu 

245 250 255 

Asn He Val Ala Asn Thr Val Leu Leu Lys Ala Arg Glu Gly Gly Gly 

260 265 270 

Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu He Leu Lys Phe 

275 280 285 

Pro His He Ser Gin Cys Glu Asp Leu Arg Arg Thr He Asp Arg Asp 

290 295 300 

Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg Leu Leu Phe Arg 
305 310 315 320 



Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr lie Gin Phe Leu 

325 330 335 

Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu Lys Leu Gly Glu 

340 345 350 

Lys Gly Lys Glu lie Met Thr Lys Tyr Leu Thr Pro Lys Ser Pro Val 

355 360 365 

Phe He Ala Gin Val Gly Gin Asp Leu Val Ser Gin Thr Glu Glu Lys 

370 375 380 

Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala Cys Ala Gin Ser 
385 390 395 400 

Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu Tyr Leu Asp Ser 

405 410 415 

Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu Glu Arg Gin Pro 

420 425 430 

Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu Gly Lys Gly Gly 

435 440 445 

Phe Gly Glu Val Cys Ala Cys Gin Val Arg Ala Thr Gly Lys Met Tyr 

450 455 460 

Ala Cys Lys Arg Leu Glu Lys Lys Arg He Lys Lys Arg Lys Gly Glu 
465 470 475 480 

Ser Met Ala Leu Asn Glu Lys Gin He Leu Glu Lys Val Asn Ser Gin 

485 490 495 

Phe Val Val Asn Leu Ala Tyr Ala Tyr Glu Thr Lys Asp Ala Leu Cys 

500 505 510 

Leu Val Leu Thr He Met Asn Gly Gly Asp Leu Lys Phe His He Tyr 

515 520 525 

Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala Leu Phe Tyr Ala 

530 535 540 

Ala Glu He Leu Cys Gly Leu Glu Asp Leu His Arg Glu Asn Thr Val 
545 550 555 560 

Tyr Arg Asp Leu Lys Pro Glu Asn He Leu Leu Asp Asp Tyr Gly His 

565 570 575 

lie Arg He Ser Asp Leu Gly Leu Ala Val Lys He Pro Glu Gly Asp 

580 585 590 

Leu He Arg Gly Arg Val Gly Thr Val Gly Tyr Met Ala Pro Glu Val 

595 600 605 

Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr Trp Gly Leu Gly 

610 615 620 

Cys Leu He Tyr Glu Met lie Glu Gly Gin Ser Pro Phe Arg Gly Arg 
625 630 635 640 

Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg Val Leu Glu Thr 

645 650 655 

Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala Lys Ser He Cys 

660 665 670 

Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu Gly Cys Gin Glu 

675 680 685 

Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe Arg Asn Met Asn 

690 695 700 

Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro Phe Val Pro Asp 
705 710 715 720 

Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp He Glu Gin Phe Ser 

725 730 735 

Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp Asp Phe Tyr Ser 

740 745 750 

Lys Phe Ser Thr Gly Ser Val Ser He Pro Trp Gin Asn Glu Met lie 

755 760 765 

Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly Pro Asn Gly Thr 
770 775 780 



Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu Pro Pro Lys Lys 
785 790 795 800 

Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin Asn Asn Ser Lys 

805 810 815 

Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His lie Asn Ser Asn 

820 825 830 

His Val Ser Ser Asn Ser Thr Gly Ser Ser 
835 840 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 1...1899 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
Val Lys Fhe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



384 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CGA GCC ATC ATG AGC AGA AGC AAG CGT GAC 7 68 

Gly Leu Arg Ser Arg Ala Arg Ala He Met Ser Arg Ser Lys Arg Asp 
245 250 255 

AAC AAT TTT TAT AGT GTA GAG ATT GGA GAT TCT ACA TTC ACA GTC CTG 816 
Asn Asn Phe Tyr Ser Val Glu He Gly Asp Ser Thr Phe Thr Val Leu 
260 265 270 

AAA CGA TAT CAG AAT TTA AAA CCT ATA GGC TCA GGA GCT CAA GGA ATA 864 
Lys Arg Tyr Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He 
275 280 285 

GTA TGC GCA GCT TAT GAT GCC ATT CTT GAA AGA AAT GTT GCA ATC AAG 912 
Val Cys Ala Ala Tyr Asp Ala lie Leu Glu Arg Asn Val Ala He Lys 
290 295 300 

AAG CTA AGC CGA CCA TTT CAG AAT CAG ACT CAT GCC AAG CGG GCC TAC 960 
Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr 
305 310 315 320 

AGA GAG CTA GTT CTT ATG AAA TGT GTT AAT CAC AAA AAT ATA ATT GGC 1008 
Arg Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Gly 
325 330 335 

CTT TTG AAT GTT TTC ACA CCA CAG AAA TCC CTA GAA GAA TTT CAA GAT 1056 
Leu Leu Asn Val Phe Thr Pro Gin Lys Ser Leu Glu Glu Phe Gin Asp 
340 345 350 

GTT TAC ATA GTC ATG GAG CTC ATG GAT GCA AAT CTT TGC CAA GTG ATT 1104 
Val Tyr He Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He 
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355 360 365 

CAG ATG GAG CTA GAT CAT GAA AGA ATG TCC TAC CTT CTC TAT CAG ATG 1152 
Gin Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met 
370 375 380 

CTG TGT GGA ATC AAG CAC CTT CAT TCT GCT GGA ATT ATT CAT CGG GAC 12 00 

Leu Cys Gly He Lys His Leu His Ser Ala Gly He He His Arg Asp 
385 390 395 400 

TTA AAG CCC AGT AAT ATA GTA GTA AAA TCT GAT TGC ACT TTG AAG ATT 12 48 

Leu Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He 
405 410 415 

CTT GAC TTC GGT CTG GCC AGG ACT GCA GGA ACG AGT TTT ATG ATG ACG 12 96 

Leu Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr 
420 425 430 

CCT TAT GTA GTG ACT CGC TAC TAC AGA GCA CCC GAG GTC ATC CTT GGC 1344 
Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly 
435 440 445 

ATG GGC TAC AAG GAA AAC GTG GAT TTA TGG TCT GTG GGG TGC ATT ATG 13 92 

Met Gly Tyr Lys Glu Asn Val Asp Leu Trp Ser Val Gly Cys He Met 
450 455 460 

GGA GAA ATG GTT TGC CAC AAA ATC CTC TTT CCA GGA AGG GAC TAT ATT 1440 
Gly Glu Met Val Cys His Lys He Leu Phe Pro Gly Arg Asp Tyr He 
465 470 475 480 

GAT CAG TGG AAT AAA GTT ATT GAA CAG CTT GGA AC A CCA TGT CCT GAA 14 88 
Asp Gin Trp Asn Lys Val lie Glu Gin Leu Gly Thr Pro Cys Pro Glu 
485 490 495 

TTC ATG AAG AAA CTG CAA CCA AC A GTA AGG ACT TAC GTT GAA AAC AGA 1536 
Phe Met Lys Lys Leu Gin Pro Thr Val Arg Thr Tyr Val Glu Asn Arg 
500 505 510 

CCT AAA TAT GCT GGA TAT AGC TTT GAG AAA CTC TTC CCT GAT GTC CTT 1584 
Pro Lys Tyr Ala Gly Tyr Ser Phe Glu Lys Leu Phe Pro Asp Val Leu 
515 520 525 

TTC CCA GCT GAC TCA GAA CAC AAC AAA CTT AAA GCC AGT CAG GCA AGG 1632 
Phe Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg 
530 535 540 

GAT TTG TTA TCC AAA ATG CTG GTA ATA GAT GCA TCT AAA AGG ATC TCT 1680 
Asp Leu Leu Ser Lys Met Leu Val He Asp Ala Ser Lys Arg He Ser 
545 550 555 560 

GTA GAT GAA GCT CTC CAA CAC CCG TAC ATC AAT GTC TGG TAT GAT CCT 1728 
Val Asp Glu Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro 
565 570 575 

TCT GAA GCA GAA GCT CCA CCA CCA AAG ATC CCT GAC AAG CAG TTA GAT 1776 
Ser Glu Ala Glu Ala Pro Pro Pro Lys He Pro Asp Lys Gin Leu Asp 
580 585 590 



GAA AGG GAA CAC ACA ATA GAA GAG TGG AAA GAA TTG ATA TAT AAG GAA 1824 
Glu Arg Glu His Thr lie Glu Glu Trp Lys Glu Leu lie Tyr Lys Glu 
595 600 605 

GTT ATG GAC TTG GAG GAG AGA ACC AAG AAT GGA GTT ATA CGG GGG CAG 1872 
Val Met Asp Leu Glu Glu Arg Thr Lys Asn Gly Val lie Arg Gly Gin 
610 615 620 

CCC TCT CCT TTA GCA CAG GTG CAG CAG TGA 1902 
Pro Ser Pro Leu Ala Gin Val Gin Gin 
625 630 



(2) INFORMATION FOR SEQ ID NO: 45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Arg Ala He Met Ser Arg Ser Lys Arg Asp 
245 250 255 
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Asn Asn Fhe Tyr 
260 

Lys Arg Tyr Gin 

275 

Val Cys Ala Ala 
290 

Lys Leu Ser Arg 

30S 

Arg Glu Leu Val 

Leu Leu Asn Val 
340 

Val Tyr lie Val 
355 

Gin Met Glu Leu 
370 

Leu Cys Gly lie 
385 

Leu Lys Pro Ser 

Leu Asp Phe Gly 
420 

Pro Tyr Val Val 
435 

Met Gly Tyr Lys 
450 

Gly Glu Met Val 
465 

Asp Gin Trp Asn 

Phe Met Lys Lys 
500 

Pro Lys Tyr Ala 
515 

Phe Pro Ala Asp 
530 

Asp Leu Leu Ser 
545 

Val Asp Glu Ala 

Ser Glu Ala Glu 
580 

Glu Arg Glu His 
595 

Val Met Asp Leu 
610 

Pro Ser Pro Leu 
625 



Ser Val Glu lie 

Asn Leu Lys Pro 
280 

Tyr Asp Ala lie 
295 

Pro Phe Gin Asn 
310 

Leu Met Lys Cys 
325 

Phe Thr Pro Gin 

Met Glu Leu Met 
360 

Asp His Glu Arg 
375 

Lys His Leu His 
390 

Asn He Val Val 
405 

Leu Ala Arg Thr 

Thr Arg Tyr Tyr 
440 

Glu Asn Val Asp 
455 

Cys His Lys He 
470 

Lys Val He Glu 
485 

Leu Gin Pro Thr 

Gly Tyr Ser Phe 
520 

Ser Glu His Asn 
535 

Lys Met Leu Val 
550 

Leu Gin His Pro 
565 

Ala Pro Pro Pro 

Thr He Glu Glu 
600 

Glu Glu Arg Thr 
615 

Ala Gin Val Gin 
630 



Gly Asp Ser Thr 
265 

He Gly Ser Gly 

Leu Glu Arg Asn 
300 

Gin Thr His Ala 
315 

Val Asn His Lys 
330 

Lys Ser Leu Glu 
345 

Asp Ala Asn Leu 

Met Ser Tyr Leu 
380 

Ser Ala Gly He 
395 

Lys Ser Asp Cys 
410 

Ala Gly Thr Ser 
425 

Arg Ala Pro Glu 

Leu Trp Ser Val 
460 

Leu Phe Pro Gly 
475 

Gin Leu Gly Thr 
490 

Val Arg Thr Tyr 
505 

Glu Lys Leu Phe 

Lys Leu Lys Ala 
540 

He Asp Ala Ser 
555 

Tyr He Asn Val 
570 

Lys He Pro Asp 
585 

Trp Lys Glu Leu 

Lys Asn Gly Val 
620 

Gin 



Phe Thr Val Leu 
270 

Ala Gin Gly He 
285 

Val Ala He Lys 

Lys Arg Ala Tyr 

320 

Asn He He Gly 
335 

Glu Phe Gin Asp 
350 

Cys Gin Val He 
365 

Leu Tyr Gin Met 

He His Arg Asp 
400 

Thr Leu Lys He 
415 

Phe Met Met Thr 
430 

Val He Leu Gly 
445 

Gly Cys He Met 

Arg Asp Tyr He 
480 

Pro Cys Pro Glu 
495 

Val Glu Asn Arg 
510 

Pro Asp Val Leu 
525 

Ser Gin Ala Arg 

Lys Arg He Ser 
560 

Trp Tyr Asp Pro 
575 

Lys Gin Leu Asp 
590 

He Tyr Lys Glu 
605 

lie Arg Gly Gin 



(2) I NFORMAT I ON FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION : 1. . .1821 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thx Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



J / 



195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 7 20 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GGG AAA ATG TCT CAG GAG AGG CCC ACG TTC TAC 7 68 

Gly Leu Arg Ser Arg Gly Lys Met Ser Gin Glu Arg Pro Thr Phe Tyr 
245 250 255 

CGG CAG GAG CTG AAC AAG ACA ATC TGG GAG GTG CCC GAG CGT TAC CAG 816 
Arg Gin Glu Leu Asn Lys Thr lie Trp Glu Val Pro Glu Arg Tyr Gin 
260 265 270 

AAC CTG TCT CCA GTG GGC TCT GGC GCC TAT GGC TCT GTG TGT GCT GCT 864 
Asn Leu Ser Pro Val Gly Ser Gly Ala Tyr Gly Ser Val Cys Ala Ala 
275 280 285 

TTT GAC ACA AAA ACG GGG TTA CGT GTG GCA GTG AAG AAG CTC TCC AGA 912 
Phe Asp Thr Lys Thr Gly Leu Arg Val Ala Val Lys Lys Leu Ser Arg 
290 295 300 

CCA TTT CAG TCC ATC ATT CAT GCG AAA AGA ACC TAC AGA GAA CTG CGG 960 
Pro Phe Gin Ser lie lie His Ala Lys Arg Thr Tyr Arg Glu Leu Arg 
305 310 315 320 

TTA CTT AAA CAT ATG AAA CAT GAA AAT GTG ATT GGT CTG TTG GAC GTT 1008 
Leu Leu Lys His Met Lys His Glu Asn Val lie Gly Leu Leu Asp Val 
325 330 335 

TTT ACA CCT GCA AGG TCT CTG GAG GAA TTC AAT GAT GTG TAT CTG GTG 1056 
Phe Thr Pro Ala Arg Ser Leu Glu Glu Phe Asn Asp Val Tyr Leu Val 
340 345 350 

ACC CAT CTC ATG GGG GCA GAT CTG AAC AAC ATT GTG AAA TGT CAG AAG 1104 
Thr His Leu Met Gly Ala Asp Leu Asn Asn lie Val Lys Cys Gin Lys 
355 360 365 

CTT ACA GAT GAC CAT GTT CAG TTC CTT ATC TAC CAA ATT CTC CGA GGT 1152 
Leu Thr Asp Asp His Val Gin Phe Leu lie Tyr Gin lie Leu Arg Gly 
370 375 380 

CTA AAG TAT ATA CAT TCA GCT GAC ATA ATT CAC AGG GAC CTA AAA CCT 12 00 
Leu Lys Tyr He His Ser Ala Asp He He His Arg Asp Leu Lys Pro 
385 390 395 400 

AGT AAT CTA GCT GTG AAT GAA GAC TGT GAG CTG AAG ATT CTG GAT TTT 1248 
Ser Asn Leu Ala Val Asn Glu Asp Cys Glu Leu Lys He Leu Asp Phe 
405 410 415 

GGA CTG GCT CGG CAC ACA GAT GAT GAA ATG ACA GGC TAC GTG GCC ACT 12 96 
Gly Leu Ala Arg His Thr Asp Asp Glu Met Thr Gly Tyr Val Ala Thr 
420 425 430 
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AGG TGG TAC AGG GCT CCT GAG ATC ATG CTG AAC TGG ATG CAT TAC AAC 1344 

Arg Trp Tyr Arg Ala Pro Glu lie Met Leu Asn Trp Met His Tyr Asn 
435 440 445 

CAG ACA GTT GAT ATT TGG TCA GTG GGA TGC ATA ATG GCC GAG CTG TTG 13 92 

Gin Thr Val Asp lie Trp Ser Val Gly Cys lie Met Ala Glu Leu Leu 

450 455 460 

ACT GGA AGA ACA TTG TTT CCT GGT ACA GAC CAT ATT GAT CAG TTG AAG 1440 

Thr Gly Arg Thr Leu Phe Pro Gly Thr Asp His lie Asp Gin Leu Lys 
465 470 475 480 

CTC ATT TTA AGA CTC GTT GGA ACC CCA GGG GCT GAG CTT TTG AAG AAA 1488 

Leu He Leu Arg Leu Val Gly Thr Pro Gly Ala Glu Leu Leu Lys Lys 
485 490 495 

ATC TCC TCA GAG TCT GCA AGA AAC TAT ATT CAG TCT TTG ACT CAG ATG 1536 

He Ser Ser Glu Ser Ala Arg Asn Tyr He Gin Ser Leu Thr Gin Met 
500 505 510 



CCG AAG ATG AAC TTT GCG AAT GTA TTT ATT GGT GCC AAT CCC CTG GCT 1584 

Pro Lys Met Asn Phe Ala Asn Val Phe He Gly Ala Asn Pro Leu Ala 
515 520 525 

GTC GAC TTG CTG GAG AAG ATG CTT GTA TTG GAC TCA GAT AAG AGA ATT 1632 

Val Asp Leu Leu Glu Lys Met Leu Val Leu Asp Ser Asp Lys Arg He 
530 535 540 

ACA GCG GCC CAA GCC CTT GCA CAT GCC TAC TTT GCT CAG TAC CAC GAT 1680 

Thr Ala Ala Gin Ala Leu Ala His Ala Tyr Phe Ala Gin Tyr His Asp 
545 550 555 560 

CCT GAT GAT GAA CCA GTG GCC GAT CCT TAT GAT CAG TCC TTT GAA AGC 1728 

Pro Asp Asp Glu Pro Val Ala Asp Pro Tyr Asp Gin Ser Phe Glu Ser 

565 570 575 



AGG GAC CTC CTT ATA GAT GAG TGG AAA AGC CTG ACC TAT GAT GAA GTC 1776 
Arg Asp Leu Leu He Asp Glu Trp Lys Ser Leu Thr Tyr Asp Glu Val 
580 585 590 



ATC AGC TTT GTG CCA CCA CCC CTT GAC CAA GAA GAG ATG GAG TCC TGA 1824 
He Ser Phe Val Pro Pro Pro Leu Asp Gin Glu Glu Met Glu Ser 
595 600 605 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



Met Val Ser Lys 
1 

Val Glu Leu Asp 
20 

Glu Gly Glu Gly 
35 

Cys Thr Thr Gly 
50 

Leu Thr Tyr Gly 
65 

Gin His Asp Phe 

Arg Thr lie Phe 
100 

Val Lys Phe Glu 
115 

lie Asp Phe Lys 
130 

Asn Tyr Asn Ser 
145 

Gly He Lys Val 

Val Gin Leu Ala 
180 

Pro Val Leu Leu 
195 

Ser Lys Asp Pro 
210 

Val Thr Ala Ala 
225 

Gly Leu Arg Ser 

Arg Gin Glu Leu 
260 

Asn Leu Ser Pro 
275 

Phe Asp Thr Lys 
290 

Pro Phe Gin Ser 
305 

Leu Leu Lys His 

Phe Thr Pro Ala 
340 

Thr His Leu Met 
355 

Leu Thr Asp Asp 
370 

Leu Lys Tyr He 
385 

Ser Asn Leu Ala 

Gly Leu Ala Arg 
420 

Arg Trp Tyr Arg 
435 

Gin Thr Val Asp 
450 



Gly Glu Glu Leu 
5 

Gly Asp Val Asn 

Asp Ala Thr Tyr 
40 

Lys Leu Pro Val 
55 

Val Gin Cys Phe 
70 

Phe Lys Ser Ala 
85 

Phe Lys Asp Asp 

Gly Asp Thr Leu 
120 

Glu Asp Gly Asn 
135 

His Asn Val Tyr 
150 

Asn Phe Lys He 
165 

Asp His Tyr Gin 

Pro Asp Asn His 
200 

Asn Glu Lys Arg 
215 

Gly He Thr Leu 
230 

Arg Gly Lys Met 
245 

Asn Lys Thr He 

Val Gly Ser Gly 
280 

Thr Gly Leu Arg 
295 

He He His Ala 
310 

Met Lys His Glu 
325 

Arg Ser Leu Glu 

Gly Ala Asp Leu 
360 

His Val Gin Phe 
375 

His Ser Ala Asp 
390 

Val Asn Glu Asp 
405 

His Thr Asp Asp 

Ala Pro Glu He 
440 

He Trp Ser Val 
455 



Phe Thr Gly Val 
10 

Gly His Lys Phe 
25 

Gly Lys Leu Thr 

Pro Trp Pro Thr 
60 

Ser Arg Tyr Pro 
75 

Met Pro Glu Gly 
90 

Gly Asn Tyr Lys 
105 

Val Asn Arg He 

He Leu Gly His 
140 

He Met Ala Asp 
155 

Arg His Asn He 
170 

Gin Asn Thr Pro 
185 

Tyr Leu Ser Thr 

Asp His Met Val 
220 

Gly Met Asp Glu 
235 

Ser Gin Glu Arg 
250 

Trp Glu Val Pro 
265 

Ala Tyr Gly Ser 

Val Ala Val Lys 
300 

Lys Arg Thr Tyr 
315 

Asn Val lie Gly 
330 

Glu Phe Asn Asp 
345 

Asn Asn He Val 

Leu He Tyr Gin 
380 

lie lie His Arg 
395 

Cys Glu Leu Lys 
410 

Glu Met Thr Gly 
425 

Met Leu Asn Trp 

Gly Cys lie Met 
460 



Val Pro He Leu 
15 

Ser Val Ser Gly 
30 

Leu Lys Phe He 
45 

Leu Val Thr Thr 

Asp His Met Lys 
80 

Tyr Val Gin Glu 
95 

Thr Arg Ala Glu 
110 

Glu Leu Lys Gly 
125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 
160 

Glu Asp Gly Ser 
175 

lie Gly Asp Gly 
190 

Gin Ser Ala Leu 
205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 
240 

Pro Thr Phe Tyr 
255 

Glu Arg Tyr Gin 
270 

Val Cys Ala Ala 
285 

Lys Leu Ser Arg 

Arg Glu Leu Arg 
320 

Leu Leu Asp Val 
335 

Val Tyr Leu Val 
350 

Lys Cys Gin Lys 

365 

He Leu Arg Gly 

Asp Leu Lys Pro 
400 

lie Leu Asp Phe 
415 

Tyr Val Ala Thr 

430 

Met His Tyr Asn 
445 

Ala Glu Leu Leu 



Thr Gly Arg Thr Leu Phe Pro Gly Thr Asp His He Asp Gin Leu Lys 
465 470 475 480 

Leu He Leu Arg Leu Val Gly Thr Pro Gly Ala Glu Leu Leu Lys Lys 

485 490 495 

He Ser Ser Glu Ser Ala Arg Asn Tyr He Gin Ser Leu Thr Gin Met 

500 505 510 

Pro Lys Met Asn Phe Ala Asn Val Phe He Gly Ala Asn Pro Leu Ala 

515 520 525 

Val Asp Leu Leu Glu Lys Met Leu Val Leu Asp Ser Asp Lys Arg He 

530 535 540 

Thr Ala Ala Gin Ala Leu Ala His Ala Tyr Phe Ala Gin Tyr His Asp 
545 550 555 560 

Pro Asp Asp Glu Pro Val Ala Asp Pro Tyr Asp Gin Ser Phe Glu Ser 

565 570 575 

Arg Asp Leu Leu He Asp Glu Trp Lys Ser Leu Thr Tyr Asp Glu Val 

580 585 590 

lie Ser Phe Val Pro Pro Pro Leu Asp Gin Glu Glu Met Glu Ser 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1...2904 
<D) OTHER INFORMATION : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



85 



90 



95 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC- GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT ATG AGT GCT GAG GGG TAC CAG TAC AGA GCG CTG TAT 7 68 

Gly Leu Arg Ser Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr 
245 250 255 

GAT TAT AAA AAG GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG GGT GAC 816 
Asp Tyr Lys Lys Glu Arg Glu Glu Asp He Asp Leu His Leu Gly Asp 
260 265 270 

ATA TTG ACT GTG AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT 864 
He Leu Thr Val Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp 
275 280 285 

GGA CAG GAA GCC AGG CCT GAA GAA ATT GGC TGG TTA AAT GGC TAT AAT 912 
Gly Gin Glu Ala Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn 
290 295 300 

GAA ACC ACA GGG GAA AGG GGG GAC TTT CCG G3A ACT TAC GTA GAA TAT 9 60 

Glu Thr Thr Gly Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr 
305 310 315 320 



ATT GGA AGG AAA AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT 1008 
lie Gly Arg Lys Lys He Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro 
325 330 335 

CGG CCT CTT CCT GTT GCA CCA GGT TCT TCG AAA ACT GAA GGA GAT GTT 1056 
Arg Pro Leu Pro Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val 
340 345 350 

GAA CAA CAA GCT TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT 1104 
Glu Gin Gin Ala Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro 
355 360 365 

CCT GAC ATT GCC CCG CCT CTT CTT ATC AAG CTC GTG GAA GCC ATT GAA 1152 
Pro Asp He Ala Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu 
370 375 380 

AAG AAA GGT CTG GAA TGT TCA ACT CTA TAC AGA ACA CAG AGC TCC AGC 1200 
Lys Lys Gly Leu Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser 
385 390 395 400 

AAC CTG GCA GAA TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG 1248 
Asn Leu Ala Glu Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val 
405 410 415 

GAC TTG GAA ATG ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC 12 96 
Asp Leu Glu Met He Asp Val His Val Leu Ala Asp Ala Phe Lys Arg 
420 425 430 

TAT CTC CTG GAC TTA CCA AAT CCT GTC ATT CCA GCA GCC GTT TAC AGT 1344 
Tyr Leu Leu Asp Leu Pro Asn Pro Val He Pro Ala Ala Val Tyr Ser 
435 440 445 

GAA ATG ATT TCT TTA GCT CCA GAA GTA CAA AGC TCC GAA GAA TAT ATT 13 92 

Glu Met He Ser Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr He 
450 455 460 

CAG CTA TTG AAG AAG CTT ATT AGG TCG CCT AGC ATA CCT CAT CAG TAT 1440 
Gin Leu Leu Lys Lys Leu He Arg Ser Pro Ser He Pro His Gin Tyr 
465 470 475 480 

TGG CTT ACG CTT CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT CAA 1488 
Trp Leu Thr Leu Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin 
485 490 495 

ACC TCC AGC AAA AAT CTG TTG AAT GCA AGA GTA CTC TCT GAA ATT TTC 1536 
Thr Ser Ser Lys Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe 
500 505 510 

AGC CCT ATG CTT TTC AGA TTC TCA GCA GCC AGC TCT GAT AAT ACT GAA 1584 
Ser Pro Met Leu Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu 
515 520 525 

AAC CTC ATA AAA GTT ATA GAA ATT TTA ATC TCA ACT GAA TGG AAT GAA 1632 
Asn Leu He Lys Val He Glu He Leu He Ser Thr Glu Trp Asn Glu 
530 535 540 

CGA CAG CCT GCA CCA GCA CTG CCT CCT AAA CCA CCA AAA CCT ACT ACT 1680 
Arg Gin Pro Ala Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr 



545 550 555 560 

GTA GCC AAC AAC GGT ATG AAT AAC AAT ATG TCC TTA CAA AAT GCT GAA 172 8 
Val Ala Asn Asn Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu 
565 570 575 

TGG TAC TGG GGA GAT ATC TCG AGG GAA GAA GTG AAT GAA AAA CTT CGA 1776 
Trp Tyr Trp Gly Asp lie Ser Arg Glu Glu Val Asn Glu Lys Leu Arg 
580 585 590 

GAT ACA GCA GAC GGG ACC TTT TTG GTA CGA GAT GCG TCT ACT AAA ATG 1824 
Asp Thr Ala Asp Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met 
595 600 605 

CAT GGT GAT TAT ACT CTT ACA CTA AGG AAA GGG GGA AAT AAC AAA TTA 1872 
His Gly Asp Tyr Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu 
610 615 620 

ATC AAA ATA TTT CAT CGA GAT GGG AAA TAT GGC TTC TCT GAC CCA TTA 1920 
lie Lys lie Phe His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu 
625 630 635 640 

ACC TTC AGT TCT GTG GTT GAA TTA ATA AAC CAC TAC CGG AAT GAA TCT 1968 
Thr Phe Ser Ser Val Val Glu Leu lie Asn His Tyr Arg Asn Glu Ser 
645 650 655 

CTA GCT CAG TAT AAT CCC AAA TTG GAT GTG AAA TTA CTT TAT CCA GTA 2 016 
Leu Ala Gin Tyr Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val 
660 665 670 

TCC AAA TAC CAA CAG GAT CAA GTT GTC AAA GAA GAT AAT ATT GAA GCT 2064 
Ser Lys Tyr Gin Gin Asp Gin Val Val Lys Glu Asp Asn He Glu Ala 
675 680 685 

GTA GGG AAA AAA TTA CAT GAA TAT AAC ACT CAG TTT CAA GAA AAA AGT 2112 
Val Gly Lys Lys Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser 
690 695 700 

CGA GAA TAT GAT AGA TTA TAT GAA GAA TAT ACC CGC ACA TCC CAG GAA 2160 
Arg Glu Tyr Asp Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu 
705 710 715 720 

ATC CAA ATG AAA AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA 22 08 

He Gin Met Lys Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys 
725 730 735 

ATA TTT GAA GAA CAG TGC CAG ACC CAA GAG CGG TAC AGC AAA GAA TAC 22 56 

He Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 
740 745 750 

ATA GAA AAG TTT AAA CGT GAA. GGC AAT GAG AAA GAA ATA CAA AGG ATT 2304 
He Glu Lys Phe Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He 
755 760 765 

ATG CAT AAT TAT GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC 2 3 52 
Met His Asn Tyr Asp Lys Leu Lys Ser Arg He Ser Glu He lie Asp 
770 775 780 



AGT AGA AGA AGA TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT 2400 
Ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 
785 790 795 800 

CGA GAA ATT GAC AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG 2448 
Arg Glu lie Asp Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin 
805 810 815 

CTG AGA AAG ACG AGA GAC CAA TAC TTG ATG TGG TTG ACT CAA AAA GGT 2496 
Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 
820 825 830 

GTT CGG CAA AAG AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA 2544 
Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 
835 840 845 

GAC CAA TAT TCA CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT 2592 
Asp Gin Tyr Ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 
850 855 860 

GAG AAG ACA TGG AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC 2640 
Glu Lys Thr Trp Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn 
865 870 875 880 

CTG TTG CGA GGG AAG CGA GAT GGC ACT TTT CTT GTC CGG GAG AGC AGT 2688 
Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser 
885 890 895 

AAA CAG GGC TGC TAT GCC TGC TCT GTA GTG GTG GAC GGC GAA GTA AAG 2736 
Lys Gin Gly Cys Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys 
900 905 910 

CAT TGT GTC ATA AAC AAA ACA GCA ACT GGC TAT GGC TTT GCC GAG CCC 27 84 
His Cys Val He Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro 
915 920 925 

TAT AAC TTG TAC AGC TCT CTG AAA GAA CTG GTG CTA CAT TAC CAA CAC 2832 
Tyr Asn Leu Tyr Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His 
930 935 940 

ACC TCC CTT GTG CAG CAC AAC GAC TCC CTC AAT GTC ACA CTA GCC TAC 2880 
Thr Ser Leu Val Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr 
945 950 955 960 

CCA GTA TAT GCA CAG CAG AGG CGA TGA 2 907 

Pro Val Tyr Ala Gin Gin Arg Arg 
965 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr 

245 250 255 

Asp Tyr Lys Lys Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp 

260 265 270 

lie Leu Thr Val Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp 

275 280 285 

Gly Gin Glu Ala Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn 

290 295 300 

Glu Thr Thr Gly Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr 
305 310 315 320 

He Gly Arg Lys Lys lie Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro 

325 330 335 

Arg Pro Leu Pro Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val 

340 345 350 

Glu Gin Gin Ala Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro 

355 360 365 

Pro Asp He Ala Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu 

370 375 380 

Lys Lys Gly Leu Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser 
385 390 395 400 

Asn Leu Ala Glu Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val 

405 410 415 

Asp Leu Glu Met lie Asp Val His Val Leu Ala Asp Ala Phe Lys Arg 
420 425 430 



Tyr Leu Leu Asp Leu Pro Asn Pro Val lie Pro Ala Ala Val Tyr Ser 

435 440 445 

Glu Met He Ser Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr He 

450 455 460 

Gin Leu Leu Lys Lys Leu He Arg Ser Pro Ser He Pro His Gin Tyr 
465 470 475 480 

Trp Leu Thr Leu Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin 

485 490 495 

Thr Ser Ser Lys Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe 

500 505 510 

Ser Pro Met Leu Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu 

515 520 525 

Asn Leu He Lys Val He Glu He Leu He Ser Thr Glu Trp Asn Glu 

530 535 540 

Arg Gin Pro Ala Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr 
545 550 555 560 

Val Ala Asn Asn Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu 

565 570 575 

Trp Tyr Trp Gly Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg 

580 585 590 

Asp Thr Ala Asp Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met 

595 600 605 

His Gly Asp Tyr Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu 

610 615 620 

He Lys He Phe His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu 
625 630 635 640 

Thr Phe Ser Ser Val Val Glu Leu He Asn His Tyr Arg Asn Glu Ser 

645 650 655 

Leu Ala Gin T^-r Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val 

660 665 670 

Ser Lys Tyr Gin Gin Asp Gin Val Val Lys Glu Asp Asn He Glu Ala 

675 680 685 

Val Gly Lys Lys Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser 

690 695 700 

Aurg Glu Tyr Asp Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu 
705 710 715 720 

He Gin Met Lys Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys 

725 730 735 

He Phe Glu Glu Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr 

740 745 750 

He Glu Lys Phe Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He 

755 760 765 

Met His Asn Tyr Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp 

770 775 780 

Ser Arg Arg Arg Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr 
785 790 795 800 

Arg Glu He Asp Lys Arg Met Asn Ser He Lys Pro Asp Leu lie Gin 

805 810 815 

Leu Arg Lys Thr Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly 

820 825 830 

Val Arg Gin Lys Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu 

835 840 845 

Asp Gin Tyr Ser Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp 

850 855 860 

Glu Lys Thr Trp Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn 
865 870 875 880 

Leu Leu Arg Gly Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser 
885 890 895 



9/ 



Lys Gin Gly Cys Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys 

900 905 910 

His Cys Val lie Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro 

915 920 925 

Tyr Asn Leu Tyr Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His 

930 935 940 

Thr Ser Leu Val Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr 
945 950 955 960 

Pro Val Tyr Ala Gin Gin Arg Arg 
965 



(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 
<ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...2157 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 2 40 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



CAG CAC GAC TTC TTC AAG TCC GCC ATC CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



CC-C ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



GT3 AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 



V 



115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG ACC ATG TCG TCC ATC 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ser Ser lie 
245 250 255 

TTG CCA TTC ACG CCG CCA GTT GTG AAG AGA CTG CTG GGA TGG AAG AAG 816 
Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu Gly Trp Lys Lys 
260 265 270 

TCA GCT GGT GGG TCT GGA GGA GCA GGC GGA GGA GAG CAG AAT GGG CAG 864 
Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu Gin Asn Gly Gin 
275 280 285 

GAA GAA AAG TGG TGT GAG AAA GCA GTG AAA AGT CTG GTG AAG AAG CTA 912 
Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu Val Lys Lys Leu 
290 295 300 

AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC ATC ACC ACT CAA 960 
Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala lie Thr Thr Gin 
305 310 315 320 

AAC TGT AAT ACT AAA TGT GTT ACC ATA CCA AGC ACT TGC TCT GAA ATT 1008 
Asn Cys Asn Thr Lys Cys Val Thr lie Pro Ser Thr Cys Ser Glu lie 
325 330 335 

TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT CAG TGG GAT ACA ACA GGC 1056 
Trp Gly Leu Ser Thr Pro Asn Thr lie Asp Gin Trp Asp Thr Thr Gly 
340 345 350 
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CTT TAC AGC TTC TCT GAA CAA ACC AGG TCT CTT GAT GGT CGT CTC CAG 1104 
Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp Gly Arg Leu Gin 
355 360 365 

GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT TGC CGA TTA TGG 1152 
Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr Cys Arg Leu Trp 
370 375 380 

CGC TGG CCT GAT CTT CAC AGT CAT CAT GAA CTC AAG GCA ATT GAA AAC 1200 
Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys Ala lie Glu Asn 
385 390 395 400 

TGC GAA TAT GCT TTT AAT CTT AAA AAG GAT GAA GTA TGT GTA AAC CCT 1248 
Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro 
405 410 415 

TAC CAC TAT CAG AGA GTT GAG ACA CCA GTT TTG CCT CCA GTA TTA GTG 1296 
Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val 
420 425 430 

CCC CGA CAC ACC GAG ATC CTA ACA GAA CTT CCG CCT CTG GAT GAC TAT 13 44 

Pro Arg His Thr Glu lie Leu Thr Glu Leu Pro Pro Leu Asp Asp Tyr 
435 440 445 

ACT CAC TCC ATT CCA GAA AAC ACT AAC TTC CCA GCA GGA ATT GAG CCA 13 92 

Thr His Ser lie Pro Glu Asn Thr Asn Phe Pro Ala Gly lie Glu Pro 
450 455 460 

CAG AGT AAT TAT ATT CCA GAA ACG CCA CCT CCT GGA TAT ATC AGT GAA 1440 
Gin Ser Asn Tyr lie Pro Glu Thr Pro Pro Pro Gly Tyr lie Ser Glu 
465 470 475 480 

GAT GGA GAA ACA AGT GAC CAA CAG TTG AAT CAA AGT ATG GAC ACA GGC 1488 
Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser Met Asp Thr Gly 
485 490 495 

TCT CCA GCA GAA CTA TCT CCT ACT ACT CTT TCC CCT GTT AAT CAT AGC 1536 
Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser 
500 505 510 

TTG GAT TTA CAG CCA GTT ACT TAC TCA GAA CCT GCA TTT TGG TGT TCA 1584 
Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser 
515 520 525 

ATA GCA TAT TAT GAA TTA AAT CAG AGG GTT GGA GAA ACC TTC CAT GCA 1632 
lie Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr Phe His Ala 
530 535 540 

TCA CAG CCC TCA CTC ACT GTA GAT GGC TTT ACA GAC CCA TCA AAT TCA 1680 
Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro Ser Asn Ser 
545 550 555 560 

GAG AGG TTC TGC TTA G3T TTA CTC TCC AAT GTT AAC CGA AAT GCC ACG 17 28 

Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg Asn Ala Thr 
565 570 575 

GTA GAA ATG ACA AGA AGG CAT ATA GGA AGA GGA GTG CGC TTA TAC TAC 177 6 

Val Glu Met Thr Arg Arg His He Gly Arg Gly Val Arg Leu Tyr Tyr 
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580 585 590 

ATA GGT GGG GAA GTT TTT GCT GAG TGC CTA AGT GAT AGT GCA ATC TTT 1824 

He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser Ala He Phe 
595 600 605 

GTG CAG AGC CCC AAT TGT AAT CAG AGA TAT GGC TGG CAC CCT GCA ACA 1872 

Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp His Pro Ala Thr 
610 615 620 

GTG TGT AAA ATT CCA CCA GGC TGT AAT CTG AAG ATC TTC AAC AAC CAG 1920 

Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He Phe Asn Asn Gin 
625 630 635 640 

GAA TTT GCT GCT CTT CTG GCT CAG TCT GTT AAT CAG GGT TTT GAA GCC 1968 

Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin Gly Phe Glu Ala 
645 650 655 

GTC TAT CAG CTA ACT AGA ATG TGC ACC ATA AGA ATG AGT TTT GTG AAA 2016 

Val Tyr Gin Leu Thr Arg Met Cys Thr He Arg Met Ser Phe Val Lys 

660 665 670 

GGG TGG GGA GCA GAA TAC CGA AGG CAG ACG GTA ACA AGT ACT CCT TGC 2064 

Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr Ser Thr Pro Cys 
675 680 685 

TGG ATT GAA CTT CAT CTG AAT GGA CCT CTA CAG TGG TTG GAC AAA GTA 2112 

Trp He Glu Leu His Leu Asn Gly Pro Leu Gin Trp Leu Asp Lys Val 
690 695 700 

TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA AGC ATG TCA TAA 2160 

Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser Ser Met Ser 
705 710 715 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Ser Ser He 

245 250 255 

Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu Gly Trp Lys Lys 

260 265 270 

Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu Gin Asn Gly Gin 

275 280 285 

Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu Val Lys Lys Leu 

290 295 300 

Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala He Thr Thr Gin 
305 310 315 320 

Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr Cys Ser Glu He 

325 330 335 

Trp Gly Leu Ser Thr Pro Asn Thr lie Asp Gin Trp Asp Thr Thr Gly 

340 345 350 

Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp Gly Arg Leu Gin 

355 360 365 

Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr Cys Arg Leu Trp 

370 375 380 

Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys Ala lie Glu Asn 
385 390 395 400 

Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val Cys Val Asn Pro 

405 410 415 

Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro Pro Val Leu Val 

420 425 430 

Pro Arg His Thr Glu He Leu Thr Glu Leu Pro Pro Leu Asp Asp Tyr 

435 440 445 

Thr His Ser He Pro Glu Asn Thr Asn Phe Pro Ala Gly lie Glu Pro 

450 455 460 

Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly Tyr He Ser Glu 
465 470 475 480 

Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser Met Asp Thr Gly 

485 490 495 

Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro Val Asn His Ser 

500 505 510 

Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala Phe Trp Cys Ser 

515 520 525 

He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr Phe His Ala 
530 535 540 



Ser Gin Pro Ser 
545 

Glu Arg Phe Cys 

Val Glu Met Thr 
580 

He Gly Gly Glu 
595 

Val Gin Ser Pro 
610 

Val Cys Lys He 
625 

Glu Phe Ala Ala 

Val Tyr Gin Leu 
660 

Gly Trp Gly Ala 
675 

Trp He Glu Leu 
690 

Leu Thr Gin Met 
705 



Leu Thr Val Asp 
550 

Leu Gly Leu Leu 
565 

Arg Arg His He 

Val Phe Ala Glu 
600 

Asn Cys Asn Gin 
615 

Pro Pro Gly Cys 
630 

Leu Leu Ala Gin 
645 

Thr Arg Met Cys 

Glu Tyr Arg Arg 
680 

His Leu Asn Gly 
695 

Gly Ser Pro Ser 
710 



Gly Phe Thr Asp 

555 

Ser Asn Val Asn 
570 

Gly Arg Gly Val 
585 

Cys Leu Ser Asp 

Arg Tyr Gly Trp 
620 

Asn Leu Lys He 
635 

Ser Val Asn Gin 
650 

Thr He Arg Met 
665 

Gin Thr Val Thr 

Pro Leu Gin Trp 
700 

Val Arg Cys Ser 
715 



Pro Ser Asn Ser 

560 

Arg Asn Ala Thr 
575 

Arg Leu Tyr Tyr 
590 

Ser Ala He Phe 
605 

His Pro Ala Thr 

Phe Asn Asn Gin 
640 

Gly Phe Glu Ala 
655 

Ser Phe Val Lys 
670 

Ser Thr Pro Cys 
685 

Leu Asp Lys Val 
Ser Met Ser 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . . 2418 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG AAT TCA ACC ATG GAC 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Asn Ser Thr Met Asp 
245 250 255 

AAT ATG TCT ATT ACG AAT ACA CCA ACA AGT AAT GAT GCC TGT CTG AGC 816 
Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys Leu Ser 
260 265 270 

ATT GTG CAT AGT TTG ATG TGC CAT AGA CAA GGT GGA GAG AGT GAA ACA 8 64 

He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 
275 280 285 

TTT GCA AAA AGA GCA ATT GAA AGT TTG GTA AAG AAG CTG AAG GAG AAA 912 
Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys Glu Lys 
29C 295 300 



AAA GAT GAA TTG GAT TCT TTA ATA ACA GCT ATA ACT AC A AAT GGA GCT 960 
Lys Asp Glu Leu Asp Ser Leu lie Thr Ala He Thr Thr Asn Gly Ala 
305 310 315 320 

CAT CCT AGT AAA TGT GTT ACC ATA CAG AGA ACA TTG GAT GGG AGG CTT 1008 
His Pro Ser Lys Cys Val Thr He Gin Arg Thr Leu Asp Gly Arg Leu 
325 330 335 

CAG GTG GCT GGT CGG AAA GGA TTT CCT CAT GTG ATC TAT GCC CGT CTC 1056 
Gin Val Ala Gly Arg Lys Gly Phe Pro His Val He Tyr Ala Arg Leu 
340 345 350 

TGG AGG TGG CCT GAT CTT CAC AAA AAT GAA CTA AAA CAT GTT AAA TAT 1104 
Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val Lys Tyr 
355 360 365 

TGT CAG TAT GCG TTT GAC TTA AAA TGT GAT AGT GTC TGT GTG AAT CCA 1152 
Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val Asn Pro 
370 375 380 

TAT CAC TAC GAA CGA GTT GTA TCA CCT GGA ATT GAT CTC TCA GGA TTA 1200 
Tyr His Tyr Glu Arg Val Val Ser Pro Gly He Asp Leu Ser Gly Leu 
385 390 395 400 

ACA CTG CAG AGT AAT GCT CCA TCA AGT ATG ATG GTG AAG GAT GAA TAT 1248 
Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp Glu Tyr 
405 410 415 

GTG CAT GAC TTT GAG GGA CAG CCA TCG TTG TCC ACT GAA GGA CAT TCA 1296 
Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly His Ser 
420 425 430 

ATT CAA ACC ATC CAG CAT CCA CCA AGT AAT CGT GCA TCG ACA GAG ACA 1344 
lie Gin Thr He Gin His Pro Pro Ser Asn Arg Ala Ser Thr Glu Thr 
435 440 445 

TAC AGC ACC CCA GCT CTG TTA GCC CCA TCT GAG TCT AAT GCT ACC AGC 13 92 
Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala Thr Ser 
450 455 460 

ACT GCC AAC TTT CCC AAC ATT CCT GTG GCT TCC ACA AGT CAG CCT GCC 1440 
Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin Pro Ala 
465 470 475 480 

AGT ATA CTG GGG GGC AGC CAT AGT GAA GGA CTG TTG CAG ATA GCA TCA 1488 
Ser lie Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He Ala Ser 
485 490 495 

GGG CCT CAG CCA GGA CAG CAG CAG AAT GGA TTT ACT GGT CAG CCA GCT 1536 
Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin Pro Ala 
500 505 510 

ACT TAC CAT CAT AAC AGC ACT ACC ACC TGG ACT GGA AGT AGG ACT GCA 1584 
Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg Thr Ala 
515 520 525 

CCA TAC ACA CCT AAT TTG CCT CAC CAC CAA AAC GGC CAT CTT CAG CAC 1632 
Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu Gin His 



530 535 540 

CAC CCG CCT ATG CCG CCC CAT CCC GGA CAT TAC TGG CCT GTT CAC AAT 16 80 

His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val His Asn 
545 550 555 560 

GAG CTT GCA TTC CAG CCT CCC ATT TCC AAT CAT CCT GCT CCT GAG TAT 1728 
Glu Leu Ala Phe Gin Pro Pro lie Ser Asn His Pro Ala Pro Glu Tyr 
565 570 575 

TGG TGT TCC ATT GCT TAC TTT GAA ATG GAT GTT CAG GTA GGA GAG ACA 1776 
Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly Glu Thr 
580 585 590 

TTT AAG GTT CCT TCA AGC TGC CCT ATT GTT ACT GTT GAT GGA TAC GTG 1824 
Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly Tyr Val 
595 600 605 

GAC CCT TCT GGA GGA GAT CGC TTT TGT TTG GGT CAA CTC TCC AAT GTC 1872 
Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser Asn Val 
610 615 620 

CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA GGC AAA GGT 1920 
His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His He Gly Lys Gly 
625 630 635 640 

GTG CAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG TGC CTT 1968 
Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg Cys Leu 
645 650 655 

AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA GAA GCT 2016 
Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg Glu Ala 
660 665 670 

GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT GCA TAT 2064 
Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser Ala Tyr 
675 680 685 

ATA AAG GTC TTT GAT TTG CGT CAG TGT CAT CGA CAG ATG CAG CAG CAG 2112 
He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin Gin Gin 
690 695 700 

GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC GTG GCA 2160 
Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala Val Ala 
705 710 715 720 

GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA GCT ATC 2 2 08 
Gly Asn lie Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro Ala He 
725 730 735 

AGT CTG TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT CGC TTA 22 56 
Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arc Arg Leu 
740 745 750 

TGC ATA CTC AGG ATG AGT TTT GTG AAA GGC TGG GGA CCG GAT TAC CCA 2 3 04 
Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp Tyr Pro 
755 760 765 



AGA CAG AGC ATC AAA GAA ACA CCT TGC TGG ATT GAA ATT CAC TTA CAC 2 3 52 
Arg Gin Ser lie Lys Glu Thr Pro Cys Trp lie Glu lie His Leu His 
770 775 780 

CGG GCC CTC CAG CTC CTA GAC GAA GTA CTT CAT ACC ATG CCG ATT GCA 2400 
Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro lie Ala 
785 790 795 800 

GAC CCA CAA CCT TTA GAC TGA 2421 
Asp Pro Gin Pro Leu Asp 
805 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Asn Ser Thr Met Asp 
245 250 255 



Asn Met Ser He Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys Leu Ser 

260 265 270 

He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser Glu Thr 

275 280 285 

Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys Glu Lys 

290 295 300 

Lys Asp Glu Leu Asp Ser Leu He Thr Ala lie Thr Thr Asn Gly Ala 
305 310 315 320 

His Pro Ser Lys Cys Val Thr lie Gin Arg Thr Leu Asp Gly Arg Leu 

325 330 335 

Gin Val Ala Gly Arg Lys Gly Phe Pro His Val He Tyr Ala Arg Leu 

340 345 350 

Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val Lys Tyr 

355 360 365 

Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val Asn Pro 

370 375 380 

Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser Gly Leu 
385 390 395 400 

Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp Glu Tyr 

405 410 415 

Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly His Ser 

420 425 430 

He Gin Thr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr Glu Thr 

435 440 445 

Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala Thr Ser 

450 455 460 

Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin Pro Ala 
465 470 475 480 

Ser lie Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He Ala Ser 

485 490 495 

Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin Pro Ala 

500 505 510 

Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg Thr Ala 

515 520 525 

Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu Gin His 

530 535 540 

His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val His Asn 
545 550 555 560 

Glu Leu Ala Phe Gin Pro Pro lie Ser Asn His Pro Ala Pro Glu Tyr 

565 570 575 

Trp Cys Ser lie Ala Tyr Phe Glu Met Asp Val Gin Val Gly Glu Thr 

580 585 590 

Phe Lys Val Pro Ser Ser Cys Pro lie Val Thr Val Asp Gly Tyr Val 

595 600 605 

Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser Asn Val 

610 615 620 

His Arg Thr Glu Ala lie Glu Arg Ala Arg Leu His lie Gly Lys Gly 
625 630 635 640 

Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg Cys Leu 

645 650 655 

Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg Glu Ala 

660 665 670 

Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser Ala Tyr 

675 680 685 

lie Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin Gin Gin 

690 695 700 

Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala Val Ala 
705 710 715 720 



Gly Asn lie Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro Ala He 

725 730 735 

Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg Arg Leu 

740 745 750 

Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp Tyr Pro 

755 760 765 

Arg Gin Ser He Lys Glu Thr Pro Cys Trp lie Glu He His Leu His 

770 775 780 

Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro He Ala 
785 790 795 800 

Asp Pro Gin Pro Leu Asp 
805 



(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3117 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



192 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT ACC ATG GCG GGC TGG ATC CAG GCC CAG CAG CTG CAG 7 68 

Gly Leu Arg Ser Thr Met Ala Gly Trp lie Gin Ala Gin Gin Leu Gin 
245 250 255 

GGA GAC GCG CTG CGC CAG ATG CAG GTG CTG TAC GGC CAG CAC TTC CCC 816 
Gly Asp Ala Leu Arg Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro 
260 265 270 

ATC GAG GTC CGG CAC TAC TTG GCC CAG TGG ATT GAG AGC CAG CCA TGG 864 
He Glu Val Arg His Tyr Leu Ala Gin Trp lie Glu Ser Gin Pro Trp 
275 280 285 

GAT OZC ATT GAC TTG GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG 912 
Asp Ala He Asp Leu Asp Asn Fro Gin Asp Arg Ala Gin Ala Thr Gin 
290 295 300 

CTC CTG GAG GGC CTG GTG CAG GAG CTG CAG AAG AAG GCG GAG CAC CAG 960 
Leu Leu Glu Gly Leu Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin 
305 310 315 320 

GTG GGG GAA GAT GGG TTT TTA CTG AAG ATC AAG CTG GGG CAC TAC GCC 1008 
Val Gly Glu Asp Gly Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala 
325 330 335 

ACG CAG CTC CAG AAA AC A TAT GAC CGC TGC CCC CTG GAG CTG GTC CGC 1056 
Thr Gin Leu Gin Lys Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg 



340 



345 



350 



TGC ATC CGG CAC ATT CTG TAC AAT GAA CAG AGG CTG GTC CGA GAA GCC 1104 
Cys lie Arg His lie Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala 
355 360 365 

AAC AAT TGC AGC TCT CCG GCT GGG ATC CTG GTT GAC GCC ATG TCC CAG 1152 
Asn Asn Cys Ser Ser Pro Ala Gly He Leu Val Asp Ala Met Ser Gin 
370 375 380 

AAG CAC CTT CAG ATC AAC CAG ACA TTT GAG GAG CTG CGA CTG GTC ACG 12 00 
Lys His Leu Gin He Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr 
385 390 395 400 

CAG GAC ACA GAG AAT GAG CTG AAG AAA CTG CAG CAG ACT CAG GAG TAC 1248 
Gin Asp Thr Glu Asn Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr 
405 410 415 

TTC ATC ATC CAG TAC CAG GAG AGC CTG AGG ATC CAA GCT CAG TTT GCC 1296 
Phe He He Gin Tyr Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala 
420 425 430 

CAG CTG GCC CAG CTG AGC CCC CAG GAG CGT CTG AGC CGG GAG ACG GCC 1344 
Gin Leu Ala Gin Leu Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala 
435 440 445 

CTC CAG CAG AAG CAG GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA 1392 
Leu Gin Gin Lys Gin Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala 
450 455 460 

CAG ACA CTG CAG CAG TAC CGC GTG GAG CTG GCC GAG AAG CAC CAG AAG 1440 
Gin Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys 
465 470 475 480 

ACC CTG CAG CTG CTG CGG AAG CAG CAG ACC ATC ATC CTG GAT GAC GAG 1488 
Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr He He Leu Asp Asp Glu 
485 490 495 

CTG ATC CAG TGG AAG CGG CGG CAG CAG CTG GCC GGG AAC GGC GGG CCC 153 6 

Leu He Gin Trp Lys Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro 
500 505 510 

CCC GAG GGC AGC CTG GAC GTG CTA CAG TCC TGG TGT GAG AAG TTG GCC 1584 
Pro Glu Gly Ser Leu Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala 
515 520 525 

GAG ATC ATC TGG CAG AAC CGG CAG CAG ATC CGC AGG GCT GAG CAC CTC 1632 
Glu He He Trp Gin Asn Arg Gin Gin He Arg Arg Ala Glu His Leu 
530 535 540 

TGC CAG CAG CTG CCC ATC CCC GGC CCA GTG GAG GAG ATG CTG GCC GAG 1680 
Cys Gin Gin Leu Pro He Pro Gly Pro Val Glu Glu Met Leu Ala Glu 
545 550 555 560 

GTC AAC GCC ACC ATC ACG GAC ATT ATC TCA GCC CTG GTG ACC AGC ACA 1728 
Val Asn Ala Thr He Thr Asp lie lie Ser Ala Leu Val Thr Ser Thr 
565 570 575 



TTC ATC ATT GAG AAG CAG CCT CCT CAG GTC CTG AAG ACC CAG ACC AAG 177 6 

Phe lie lie Glu Lys Gin Pro Pro Gin Val Leu Lys Thr Gin Thr Lys 
580 585 590 

TTT GCA GCC ACC GTA CGC CTG CTG GTG GGC GGG AAG CTG AAC GTG CAC 1824 
Phe Ala Ala Thr Val Arg Leu Leu Val Gly Gly Lys Leu Asn Val His 
595 600 605 

ATG AAT CCC CCC CAG GTG AAG GCC ACC ATC ATC AGT GAG CAG CAG GCC 1872 
Met Asn Pro Pro Gin Val Lys Ala Thr lie He Ser Glu Gin Gin Ala 
610 615 620 

AAG TCT CTG CTT AAA AAT GAG AAC ACC CGC AAC GAG TGC AGT GGT GAG 1920 
Lys Ser Leu Leu Lys Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu 
625 630 635 640 

ATC CTG AAC AAC TGC TGC GTG ATG GAG TAC CAC CAA GCC ACG GGC ACC 1968 
He Leu Asn Asn Cys Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr 
645 650 655 

CTC AGT GCC CAC TTC AGG AAC ATG TCA CTG AAG AGG ATC AAG CGT GCT 2016 
Leu Ser Ala His Phe Arg Asn Met Ser Leu Lys Arg He Lys Arg Ala 
660 665 670 

GAC CGG CGG GGT GCA GAG TCC GTG ACA GAG GAG AAG TTC ACA GTC CTG 2064 
Asp Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu 
675 680 685 

TTT GAG TCT CAG TTC AGT GTT GGC AGC AAT GAG CTT GTG TTC CAG GTG 2112 
Phe Glu Ser Gin Phe Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val 
690 695 700 

AAG ACT CTG TCC CTA CCT GTG GTT GTC ATC GTC CAC GGC AGC CAG GAC 2160 
Lys Thr Leu Ser Leu Pro Val Val Val He Val His Gly Ser Gin Asp 
705 710 715 720 

CAC AAT GCC ACG GCT ACT GTG CTG TGG GAC AAT GCC TTT GCT GAG CCG 2208 
His Asn Ala Thr Ala Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro 
725 730 735 

GGC AGG GTG CCA TTT GCC GTG CCT GAC AAA GTG CTG TGG CCG CAG CTG 22 56 
Gly Arg Val Pro Phe Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu 
740 745 750 

TGT GAG GCG CTC AAC ATG AAA TTC AAG GCC GAA GTG CAG AGC AAC CGG 2 3 04 
Cys Glu Ala Leu Asn Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg 
755 760 765 

GGC CTG ACC AAG GAG AAC CTC GTG TTC CTG GCG CAG AAA CTG TTC AAC 2352 
Gly Leu Thr Lys Glu Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn 
770 775 780 

AAC AGC AGC AGC CAC CTG GAG GAC TAC AGT GGC CTG TCC GTG TCC TGG 2400 
Asn Ser Ser Ser His Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp 
785 790 795 800 

TCC CAG TTC AAC AGG GAG AAC TTG CCG GGC TGG AAC TAC ACC TTC TGG 2448 
Ser Gin Phe Asn Arg Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp 



3 & 



805 810 815 

CAG TGG TTT GAC GGG GTG ATG GAG GTG TTG AAG AAG CAC CAC AAG CCC 2496 
Gin Trp Phe Asp Gly Val Met Glu Val Leu Lys Lys His His Lys Pro 
820 825 830 

CAC TGG AAT GAT GGG GCC ATC CTA GGT TTT GTG AAT AAG CAA CAG GCC 2 544 
His Trp Asn Asp Gly Ala lie Leu Gly Phe Val Asn Lys Gin Gin Ala 
835 840 845 

CAC GAC CTG CTC ATC AAC AAG CCC GAC GGG ACC TTC TTG TTG CGC TTT 2 592 
His Asp Leu Leu lie Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe 
850 855 860 

AGT GAC TCA GAA ATC GGG GGC ATC ACC ATC GCC TGG AAG TTT GAC TCC 2640 
Ser Asp Ser Glu lie Gly Gly lie Thr lie Ala Trp Lys Phe Asp Ser 
865 870 875 880 

CCG GAA CGC AAC CTG TGG AAC CTG AAA CCA TTC ACC ACG CGG GAT TTC 2688 
Pro Glu Arg Asn Leu Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe 
885 890 895 

TCC ATC AGG TCC CTG GCT GAC CGG CTG GGG GAC CTG AGC TAT CTC ATC 2736 
Ser lie Arg Ser Leu Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu lie 
900 905 910 

TAT GTG TTT CCT GAC CGC CCC AAG GAT GAG GTC TTC TCC AAG TAC TAC 2784 
Tyr Val Phe Pro Asp Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr 
915 920 925 

ACT CCT GTG CTG GCT AAA GCT GTT GAT GGA TAT GTG AAA CCA CAG ATC 2 832 
Thr Pro Val Leu Ala Lys Ala Val Asp Gly Tyr Val Lys Pro Gin lie 
930 935 940 

AAG CAA GTG GTC CCT GAG TTT GTG AAT GCA TCT GCA GAT GCT GGG GGC 2 880 
Lys Gin Val Val Pro Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly 
945 950 955 960 

AGC AGC GCC ACG TAC ATG GAC CAG GCC CCC TCC CCA GCT GTG TGC CCC 2928 
Ser Ser Ala Thr Tyr Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro 
965 970 975 

CAG GCT CCC TAT AAC ATG TAC CCA CAG AAC CCT GAC CAT GTA CTC GAT 2 976 
Gin Ala Pro Tyr Asn Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp 
980 985 990 

CAG GAT GGA GAA TTC GAC CTG GAT GAG ACC ATG GAT GTG GCC AGG CAC 3 024 
Gin Asp Gly Glu Phe Asp Leu Asp Glu Thr Met Asp Val Ala Arg His 
995 1000 1005 

GTG GAG GAA CTC TTA CGC CGA CCA ATG GAC AGT CTT GAC TCC CGC CTC 3 072 
Val Glu Glu Leu Leu Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu 
1010 1015 1020 

TCG CCC CCT GCC GGT CTT TTC ACC TCT GCC AGA GGC TCC CTC TCA TGA 3120 
Ser Pro Pro Ala Gly Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser 
1025 1030 1035 1 



(2) INFORMATION FOR SEQ ID NO: 55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Thr Met Ala Gly Trp He Gin Ala Gin Gin Leu Gin 

245 250 255 

Gly Asp Ala Leu Arg Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro 

260 265 270 

He Glu Val Arg His Tyr Leu Ala Gin Trp He Glu Ser Gin Pro Trp 

275 280 285 

Asp Ala He Asp Leu Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin 

290 295 300 

Leu Leu Glu Gly Leu Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin 
305 310 315 320 

Val Gly Glu Asp Gly Phe Leu Leu Lys He Lys Leu Gly His Tyr Ala 

325 330 335 

Thr Gin Leu Gin Lys Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg 
340 345 350 



Cys He Arg His He Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala 

355 360 365 

Asn Asn Cys Ser Ser Pro Ala Gly He Leu Val Asp Ala Met Ser Gin 

370 375 380 

Lys His Leu Gin He Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr 
385 390 395 400 

Gin Asp Thr Glu Asn Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr 

405 410 415 

Phe He He Gin Tyr Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala 

420 425 430 

Gin Leu Ala Gin Leu Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala 

435 440 445 

Leu Gin Gin Lys Gin Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala 

450 455 460 

Gin Thr Leu Gin Gin Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys 
465 470 475 480 

Thr Leu Gin Leu Leu Arg Lys Gin Gin Thr He He Leu Asp Asp Glu 

485 490 495 

Leu He Gin Trp Lys Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro 

500 505 510 

Pro Glu Gly Ser Leu Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala 

515 520 525 

Glu He He Trp Gin Asn Arg Gin Gin He Arg Arg Ala Glu His Leu 

530 535 540 

Cys Gin Gin Leu Pro He Pro Gly Pro Val Glu Glu Met Leu Ala Glu 
545 550 555 560 

Val Asn Ala Thr He, Thr Asp He He Ser Ala Leu Val Thr Ser Thr 

565 570 575 

Phe He He Glu Lys Gin Pro Pro Gin Val Leu Lys Thr Gin Thr Lys 

580 585 590 

Phe Ala Ala Thr Val Arg Leu Leu Val Gly Gly Lys Leu Asn Val His 

595 600 605 

Met Asn Pro Pro Gin Val Lys Ala Thr He He Ser Glu Gin Gin Ala 

610 615 620 

Lys Ser Leu Leu Lys Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu 
625 630 635 640 

He Leu Asn Asn Cys Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr 

645 650 655 

Leu Ser Ala His Phe Arg Asn Met Ser Leu Lys Arg He Lys Arg Ala 

660 665 670 

Asp Arg Arg Gly Ala Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu 

675 680 685 

Phe Glu Ser Gin Phe Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val 

690 695 700 

Lys Thr Leu Ser Leu Pro Val Val Val He Val His Gly Ser Gin Asp 
705 710 715 720 

His Asn Ala Thr Ala Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro 

725 730 735 

Gly Arg Val Pro Phe Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu 

740 745 750 

Cys Glu Ala Leu Asn Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg 

755 760 765 

Gly Leu Thr Lys Glu Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn 

770 775 780 

Asn Ser Ser Ser His Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp 
785 790 795 800 

Ser Gin Phe Asn Arg Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp 
805 810 815 
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Gin Trp Phe Asp Gly Val Met Glu Val Leu Lys Lys His His Lys Pro 

820 825 830 

His Trp Asn Asp Gly Ala lie Leu Gly Phe Val Asn Lys Gin Gin Ala 

835 840 845 

His Asp Leu Leu He Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe 

850 855 860 

Ser Asp Ser Glu He Gly Gly He Thr He Ala Trp Lys Phe Asp Ser 
865 870 875 880 

Pro Glu Arg Asn Leu Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe 

885 890 895 

Ser He Arg Ser Leu Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He 

900 905 910 

Tyr Val Phe Pro Asp Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr 

915 920 925 

Thr Pro Val Leu Ala Lys Ala Val Asp Gly Tyr Val Lys Pro Gin He 

930 935 940 

Lys Gin Val Val Pro Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly 
945 950 955 960 

Ser Ser Ala Thr Tyr Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro 

965 970 975 

Gin Ala Pro Tyr Asn Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp 

980 985 990 

Gin Asp Gly Glu Phe Asp Leu Asp Glu Thr Met Asp Val Ala Arg His 

995 1000 1005 

Val Glu Glu Leu Leu Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu 

1010 1015 1020 

Ser Pro Pro Ala Gly Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser 
025 1030 1035 1 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . . 1872 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATG GCG GCG GCG GCG GCG GCT CCG GGG GGC GGG GGC GGG GAG CCC AGG 48 
Met Ala Ala Ala Ala Ala Ala Pro Gly Gly Gly Gly Gly Glu Pro Arg 
15 10 15 

GGA ACT GCT GGG GTC GTC CCG GTG GTC CCC GGG GAG GTG GAG GTG GTG 96 
Gly Thr Ala Gly Val Val Pro Val Val Pro Gly Glu Val Glu Val Val 
20 25 30 



AAG GGG CAG CCA TTC GAT GTG GGC CCA CGC TAC ACG CAG CTG CAG TAC 
Lys Gly Gin Pro Phe Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr 
35 4G 45 



144 
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ATC GGC GAG GGC GCG TAC GGC ATG GTC AGC TCA GCT TAT GAC CAC GTG 192 

He Gly Glu Gly Ala Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val 
50 55 60 

CGC AAG ACC AGA GTG GCC ATC AAG AAG ATC AGC CCC TTT GAG CAT CAA 24 0 

Arg Lys Thr Arg Val Ala He Lys Lys He Ser Pro Phe Glu His Gin 
65 70 75 80 

ACC TAC TGT CAG CGC ACG CTG AGG GAG ATC CAG ATC TTG CTG CGA TTC 2 88 

Thr Tyr Cys Gin Arg Thr Leu Arg Glu He Gin He Leu Leu Arg Phe 

85 90 95 

CGC CAT GAG AAT GTT ATA GGC ATC CGA GAC ATC CTC AGA GCG CCC ACC 3 36 

Arg His Glu Asn Val He Gly He Arg Asp He Leu Arg Ala Pro Thr 
100 105 110 

CTG GAA GCC ATG AGA GAT GTT TAC ATT GTT CAG GAC CTC ATG GAG ACA 3 84 

Leu Glu Ala Met Arg Asp Val Tyr lie Val Gin Asp Leu Met Glu Thr 
115 120 125 

GAC CTG TAC AAG CTG CTT AAA AGC CAG CAG CTG AGC AAT GAC CAC ATC 432 

Asp Leu Tyr Lys Leu Leu Lys Ser Gin Gin Leu Ser Asn Asp His He 
130 135 140 

TGC TAC TTC CTC TAC CAG ATC CTC CGG GGC CTC AAG TAT ATA CAC TCA 480 

Cys Tyr Phe Leu Tyr Gin lie Leu Arg Gly Leu Lys Tyr He His Ser 
145 150 155 160 

GCC AAT GTG CTG CAC CGG GAC CTG AAG CCT TCC AAT CTG CTT ATC AAC 528 

Ala Asn Val Leu His Arg Asp Leu Lys Pro Ser A.sn Leu Leu He Asn 

165 170 175 

ACC ACC TGC GAC CTT AAG ATC TGT GAT TTT GGC CTG GCC CGG ATT GCT 576 

Thr Thr Cys Asp Leu Lys lie Cys Asp Phe Gly Leu Ala Arg lie Ala 
180 185 190 

GAC CCT GAG CAC GAC CAC ACT GGC TTT CTG ACG GAG TAT GTG GCC ACA 624 

Asp Pro Glu His Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr 
195 200 205 

CGC TGG TAC CGA GCC CCA GAG ATC ATG CTT AAT TCC AAG GGC TAC ACC 672 

Arg Trp Tyr Arg Ala Pro Glu lie Met Leu Asn Ser Lys Gly Tyr Thr 
210 215 220 

AAA TCC ATC GAC ATC TGG TCT GTG GGC TGC ATT CTG GCT GAG ATG CTC 720 

Lys Ser He Asp He Trp Ser Val Gly Cys lie Leu Ala Glu Met Leu 
225 230 235 240 

TCC AAC CGG CCC ATC TTC CCC GGC AAG CAC TAC CTG GAC CAG CTC AAC 7 68 

Ser Asn Arg Pro lie Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn 

245 250 255 

CAC ATT CTA GGT ATC TTG GGT TCC CCA TCC CAG GAG GAC CTT AAT TGC 816 

His lie Leu Gly lie Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys 
260 265 270 

ATC ATT AAC ATG AAG GCC CGA AAC TAC CTG CAG TCT CTG CCC TCG AAA 864 

He He Asn Met Lys Ala Arg Asn Tyr Leu Gin Ser Leu Pro Ser Lys 
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275 280 285 

ACC AAG GTG GCT TGG GCC AAG CTC TTT CCT AAA TCT GAC TCC AAA GCT 912 
Thr Lys Val Ala Trp Ala Lys Leu Phe Pro Lys Ser Asp Ser Lys Ala 
290 295 300 

CTT GAC CTG CTG GAC CGG ATG TTA ACC TTC AAC CCA AAC AAG CGC ATC 960 
Leu Asp Leu Leu Asp Arg Met Leu Thr Phe Asn Pro Asn Lys Arg lie 
305 310 315 320 

AC A GTA GAG GAA GCG CTG GCT CAC CCT TAC CTG GAA CAG TAC TAC GAT 1008 
Thr Val Glu Glu Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp 
325 330 335 

CCG ACA GAT GAG CCA GTG GCC GAG GAG CCA TTC ACC TTC GAC ATG GAG 1056 
Pro Thr Asp Glu Pro Val Ala Glu Glu Pro Phe Thr Phe Asp Met Glu 
340 345 350 

CTG GAT GAC CTC CCC AAG GAG CGG CTG AAG GAG TTG ATC TTC CAG GAG 1104 
Leu Asp Asp Leu Pro Lys Glu Arg Leu Lys Glu Leu lie Phe Gin Glu 
355 360 365 

ACA GCC CGC TTC CAG CCA GGG GCG CCA GAG GGC CCC GGG CGC GCC ATG 1152 
Thr Ala Arg Phe Gin Pro Gly Ala Pro Glu Gly Pro Gly Arg Ala Met 
370 375 380 

AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT CTT GTT GAA 1200 
Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
385 390 395 400 

TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT GGA GAG GGT 1248 
Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Gly Glu 'Gly 
405 410 415 

GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC ACT 1296 
Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 
420 425 430 

ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT ACT CTC ACT 13 44 
Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
435 440 445 

TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA GAT CAT ATG AAA CAG CAT 13 92 
Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
450 455 460 

GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG GAA AGA ACT 144 0 
Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
465 470 475 480 

ATA TTT TAC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC AAG 14 88 
He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
485 490 495 

TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT GAT 153 6 

Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 
500 505 510 



TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA ATG GAA TAC AAT TAT IS 84 

Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Met Glu Tyr Asn Tyr 
515 520 525 

AAC TCA CAT AAT GTA TAC ATC ATG GCA GAC AAA CCA AAG AAT GGC ATC 1632 
Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro Lys Asn Gly He 
530 535 540 

AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT GGA AGC GTT CAA 1680 
Lys Val Asn Phe Lys lie Arg His Asn He Lys Asp Gly Ser Val Gin 
545 550 555 560 

TTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT GTC 1728 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
565 570 575 

CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT GCC CTT TCC AAA 1776 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
580 585 590 

GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT GAG TTT GTA ACA 1824 
Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val Thr 
595 600 605 

GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA CCT CAG GAG T 1873 
Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Pro Gin Glu 
610 615 620 



AA 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Ala Ala Ala Ala Ala Ala Pro Gly Gly Gly Gly Gly Glu Pro Arg 

15 10 15 

Gly Thr Ala Gly Val Val Pro Val Val Pro Gly Glu Val Glu Val Val 

20 25 30 

Lys Gly Gin Pro Phe Asp Val Gly Pro Arg Tyr Thr Gin Leu Gin Tyr 

35 40 45 

He Gly Glu Gly Ala Tyr Gly Met Val Ser Ser Ala Tyr Asp His Val 

50 55 60 

Arg Lys Thr Arg Val Ala He Lys Lys He Ser Pro Phe Glu His Gin 
65 70 75 80 

Thr Tyr Cys Gin Arg Thr Leu Arg Glu He Gin He Leu Leu Arg Phe 

85 90 95 

Arg His Glu Asn Val He Gly He Arg Asp He Leu Arg Ala Pro Thr 

100 105 HO 

Leu Glu Ala Met Arg Asp Val Tyr He Val Gin Asp Leu Met Glu Thr 



1875 



115 

Asp Leu Tyr Lys 
130 

Cys Tyr Phe Leu 
145 

Ala Asn Val Leu 

Thr Thr Cys Asp 
180 

Asp Pro Glu His 
195 

Arg Trp Tyr Arg 
210 

Lys Ser He Asp 
225 

Ser Asn Arg Pro 

His He Leu Gly 
260 

He He Asn Met 
275 

Thr Lys Val Ala 
290 

Leu Asp Leu Leu 
305 

Thr Val Glu Glu 

Pro Thr Asp Glu 
340 

Leu Asp Asp Leu 
355 

Thr Ala Arg Phe 
370 

Ser Lys Gly Glu 
385 

Leu Asp Gly Asp 

Glu Gly Asp Ala 
420 

Thr Gly Lys Leu 
435 

Tyr Gly Val Gin 
450 

Asp Phe Phe Lys 
465 

He Phe Tyr Lys 

Phe Glu Gly Asp 
500 

Phe Lys Glu Asp 
515 

Asn Ser His Asn 
530 

Lys Val Asn Phe 
545 

Leu Ala Asp His 
Leu Leu Pro Asp 



120 

Leu Leu Lys Ser 
135 

Tyr Gin He Leu 
150 

His Arg Asp Leu 
165 

Leu Lys He Cys 

Asp His Thr Gly 
200 

Ala Pro Glu He 
215 

He Trp Ser Val 
230 

He Phe Pro Gly 
245 

He Leu Gly Ser 

Lys Ala Arg Asn 
280 

Trp Ala Lys Leu 
295 

Asp Arg Met Leu 
310 

Ala Leu Ala His 
325 

Pro Val Ala Glu 

Pro Lys Glu Arg 
360 

Gin Pro Gly Ala 
375 

Glu Leu Phe Thr 
390 

Val Asn Gly Gin 
405 

Thr Tyr Gly Lys 

Pro Val Pro Trp 
440 

Cys Phe Ser Arg 
455 

Ser Ala Met Pro 
470 

Asp Asp Gly Asn 
485 

Thr Leu Val Asn 

Gly Asn He Leu 
520 

Val Tyr He Met 
535 

Lys He Arg His 
550 

Tyr Gin Gin Asn 
565 

Asn His Tyr Leu 



Gin Gin Leu Ser 
140 

Arg Gly Leu Lys 
155 

Lys Pro Ser Asn 
170 

Asp Phe Gly Leu 
185 

Phe Leu Thr Glu 

Met Leu Asn Ser 
220 

Gly Cys He Leu 
235 

Lys His Tyr Leu 
250 

Pro Ser Gin Glu 
265 

Tyr Leu Gin Ser 

Phe Pro Lys Ser 
300 

Thr Phe Asn Pro 
315 

Pro Tyr Leu Glu 
330 

Glu Pro Phe Thr 
345 

Leu Lys Glu Leu 

Pro Glu Gly Pro 
380 

Gly Val Val Pro 
395 

Lys Phe Ser Val 
410 

Leu Thr Leu Lys 
425 

Pro Thr Leu Val 

Tyr Pro Asp His 
460 

Glu Gly Tyr Val 
475 

Tyr Lys Thr Arg 
490 

Arg He Glu Leu 
505 

Gly His Lys Met 

Ala Asp Lys Pro 
540 

Asn He Lys Asp 
555 

Thr Pro He Gly 
570 

Ser Thr Gin Ser 



125 

Asn Asp His He 

Tyr lie His Ser 
160 

Leu Leu He Asn 
175 

Ala Arg lie Ala 
190 

Tyr Val Ala Thr 
205 

Lys Gly Tyr Thr 

Ala Glu Met Leu 
240 

Asp Gin Leu Asn 
255 

Asp Leu Asn Cys 
270 

Leu Pro Ser Lys 
285 

Asp Ser Lys Ala 

Asn Lys Arg lie 
320 

Gin Tyr Tyr Asp 
335 

Phe Asp Met Glu 
350 

lie Phe Gin Glu 
365 

Gly Arg Ala Met 

lie Leu Val Glu 
400 

Ser Gly Glu Gly 
415 

Phe He Cys Thr 
430 

Thr Thr Leu Thr 
445 

Met Lys Gin His 

Gin Glu Arg Thr 
480 

Ala Glu Val Lys 
495 

Lys Gly lie Asp 
510 

Glu Tyr Asn Tyr 
525 

Lys Asn Gly lie 

Gly Ser Val Gin 
560 

Asp Gly Pro Val 
575 

Ala Leu Ser Lys 
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580 585 590 

Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val Thr 

595 600 605 

Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Pro Gin Glu 
610 615 620 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1811 
(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



ATG GCG GCG GCG GCG GCG GCG GGC CCG GAG ATG GTC CGC GGG CAG GTG 4 8 

Met Ala Ala Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 
15 10 15 



TTC GAC GTG GGG CCG CGC TAC ACT AAT CTC TCG TAC ATC GGA GAA GGC 96 
Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr He Gly Glu Gly 
20 25 30 



GCC TAC GGC ATG GTT TGT TCT GCT TAT GAT AAT CTC AAC AAA GTT CGA 144 
Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Leu Asn Lys Val Arg 
35 40 45 



GTT GCT ATC AAG AAA ATC AGT CCT TTT GAG CAC CAG ACC TAC TGT CAG 192 
Val Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin 
50 55 60 



AGA ACC CTG AGA GAG ATA AAA ATC CTA CTG CGC TTC AGA CAT GAG AAC 240 
Arg Thr Leu Arg Glu He Lys lie Leu Leu Arg Phe Arg His Glu Asn 
65 70 75 80 



ATC ATC GGC ATC AAT GAC ATC ATC CGG GCA CCA ACC ATT GAG CAG ATG 2 88 

He He Gly lie Asn Asp He lie Arg Ala Pro Thr He Glu Gin Met 

85 90 95 



AAA GAT GTA TAT ATA GTA CAG GAC CTC ATG GAG ACA GAT CTT TAC AAG 3 36 

Lys Asp Val Tyr lie Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys 
100 105 110 



CTC TTG AAG ACA CAG CAC CTC AGC AAT GAT CAT ATC TGC TAT TTT CTT 3 84 

Leu Leu Lys Thr Gin His Leu Ser Asn Asp His He Cys Tyr Phe Leu 
115 120 125 



TAT CAG ATC CTG AGA GGA TTA AAG TAT ATA CAT TCA GCT AAT GTT CTG 
Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu 
130 135 140 



432 



CAC CGT GAC CTC AAG CCT TCC AAC CTC CTG CTG AAC ACC ACT TGT GAT 4 80 

His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
145 150 155 160 

CTC AAG ATC TGT GAC TTT GGC CTT GCC CGT GTT GCA GAT CCA GAC CAT 52 8 

Leu Lys lie Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro Asp His 
165 170 175 

GAT CAT ACA GGG TTC TTG ACA GAG TAT GTA GCC ACG CGT TGG TAC AGA 576 
Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg 
180 185 190 

GCT CCA GAA ATT ATG TTG AAT TCC AAG GGT TAT ACC AAG TCC ATT GAT 624 
Ala Pro Glu lie Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser lie Asp 
195 200 205 

ATT TGG TCT GTG GGC TGC ATC CTG GCA GAG ATG CTA TCC AAC AGG CCT 672 
lie Trp Ser Val Gly Cys lie Leu Ala Glu Met Leu Ser Asn Arg Pro 
210 215 220 

ATC TTC CCA GGA AAG CAT TAC CTT GAC CAG CTG AAT CAC ATC CTG GGT 7 20 

lie Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His lie Leu Gly 
225 230 235 240 

ATT CTT GGA TCT CCA TCA CAG GAA GAT CTG AAT TGT ATA ATA AAT TTA 768 
lie Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys lie lie Asn Leu 
245 250 255 

AAA GCT AGA AAC TAT TTG CTT TCT CTC CCG CAC AAA AAT AAG GTG CCG 816 
Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys Val Pro 
260 265 270 

TGG AAC AGG TTG TTC CCA AAC GCT GAC TCC AAA GCT CTG GAT TTA CTG 864 
Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp Leu Leu 
275 280 285 

GAT AAA ATG TTG ACA TTT AAC CCT CAC AAG AGG ATT GAA GTT GAA CAG 912 
Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg lie Glu Val Glu Gin 
290 295 300 

GCT CTG GCC CAC CCG TAC CTG GAG CAG TAT TAT GAC CCA AGT GAT GAG 960 
Ala Leu Ala His Pro Tyr Leu Glu Gin Tyr Tyr Asp Pro Ser Asp Glu 
305 310 315 320 

CCC ATT GCT GAA GCA CCA TTC AAG TTT GAC ATG GAG CTG GAC GAC TTA 1008 
Pro lie Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp Asp Leu 
325 330 335 

CCT AAG GAG AAG CTC AAA GAA CTC ATT TTT GAA GAG ACT GCT CGA TTC 1056 
Pro Lys Glu Lys Leu Lys Glu Leu lie Phe Glu Glu Thr Ala Arg Phe 
340 345 350 

CAG CCA GGA TAC AGA TCT ATG GAT CCA CCG GTC GCC ACC ATG GTG AGC 1104 
Gin Pre Gly Tyr Arg Ser Met Asp Pro Pro Val Ala Thr Met Val Ser 
355 360 365 

AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG 1152 



Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu 

370 375 380 

GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG 1200 

Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 

3S5 390 395 400 

G3C GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC 1248 

Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr 

405 410 415 

GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC 1296 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC 1344 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

435 440 445 

TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC 13 92 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC 1440 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 

465 470 475 480 

GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC 1488 

Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 

485 490 495 

AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC 1536 

Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 

500 505 510 

AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG 1584 

Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly lie Lys 

515 520 525 

GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC 1632 

Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

530 535 540 

GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG 1680 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 

545 550 555 560 

CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC 1728 

Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 

565 570 575 

CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC 1776 

Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 

580 585 590 

GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AA GTAA 1815 

Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

595 600 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Ala Ala Ala Ala Ala Ala Gly Pro Glu Met Val Arg Gly Gin Val 

15 10 15 

Phe Asp Val Gly Pro Arg Tyr Thr Asn Leu Ser Tyr lie Gly Glu Gly 

20 25 30 

Ala Tyr Gly Met Val Cys Ser Ala Tyr Asp Asn Leu Asn Lys Val Arg 

35 40 45 

Val Ala He Lys Lys He Ser Pro Phe Glu His Gin Thr Tyr Cys Gin 

50 55 60 

Arg Thr Leu Arg Glu He Lys He Leu Leu Arg Phe Arg His Glu Asn 
65 70 75 80 

lie He Gly He Asn Asp He He Arg Ala Pro Thr He Glu Gin Met 

85 90 95 

Lys Asp Val Tyr He Val Gin Asp Leu Met Glu Thr Asp Leu Tyr Lys 

100 105 HO 

Leu Leu Lys Thr Gin His Leu Ser Asn Asp His He Cys Tyr Phe Leu 

115 120 125 

Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala Asn Val Leu 

130 135 140 

His Arg Asp Leu Lys Pro Ser Asn Leu Leu Leu Asn Thr Thr Cys Asp 
145 150 155 160 

Leu Lys He Cys Asp Phe Gly Leu Ala Arg Val Ala Asp Pro Asp His 

165 170 175 

Asp His Thr Gly Phe Leu Thr Glu Tyr Val Ala Thr Arg Trp Tyr Arg 

180 185 190 

Ala Pro Glu He Met Leu Asn Ser Lys Gly Tyr Thr Lys Ser He Asp 

195 200 205 

He Trp Ser Val Gly Cys He Leu Ala Glu Met Leu Ser Asn Arg Pro 

210 215 220 

He Phe Pro Gly Lys His Tyr Leu Asp Gin Leu Asn His He Leu Gly 
225 230 235 240 

He Leu Gly Ser Pro Ser Gin Glu Asp Leu Asn Cys lie lie Asn Leu 

245 250 255 

Lys Ala Arg Asn Tyr Leu Leu Ser Leu Pro His Lys Asn Lys Val Pro 

260 265 270 

Trp Asn Arg Leu Phe Pro Asn Ala Asp Ser Lys Ala Leu Asp Leu Leu 

275 280 285 

Asp Lys Met Leu Thr Phe Asn Pro His Lys Arg lie Glu Val Glu Gin 

290 295 300 

Ala Leu Ala His Pre Tyr Leu Glu Gin Tyr Tyr Asp Pro Ser Asp Glu 
305 310 315 320 

Pro lie Ala Glu Ala Pro Phe Lys Phe Asp Met Glu Leu Asp Asp Leu 

325 330 335 

Pro Lys Glu Lys Leu Lys Glu Leu lie Phe Glu Glu Thr Ala Arg Phe 
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340 345 350 

Gin Pro Gly Tyr Arg Ser Met Asp Pro Pro Val Ala Thr Met Val Ser 

355 360 365 

Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu 

370 375 380 

Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
385 390 395 400 

Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr 

405 410 415 

Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 

420 425 430 

Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp 

435 440 445 

Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He 

450 455 460 

Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
465 470 475 480 

Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe 

485 490 495 

Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 

500 505 510 

Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly He Lys 

515 520 525 

Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 

530 535 540 

Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 
545 550 555 560 

Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 

565 570 575 

Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 

580 585 590 

Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
fix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . .2508 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

ATG GAG CTG GAA AAC ATC GTG GCC AAC ACG GTC TTG CTG AAA GCC AGG 48 
Met Glu Leu Glu Asn lie Val Ala Asn Thr Val Leu Leu Lys Ala Arg 
15 10 15 



GAA GGG GGC GGA GGA AAG CGC AAA GGG AAA AGC AAG AAG TGG AAA GAA 
Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 
20 25 30 



96 
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ATC CTG AAG TTC CCT CAC ATT AGC CAG TGT GAA GAC CTC CGA AGG ACC 144 
He Leu Lys Phe Pro His He Ser Gin Cys Glu Asp Leu Arg Arg Thr 
35 40 45 

ATA GAC AGA GAT TAC TGC AGT TTA TGT GAC AAG CAG CCA ATC GGG AGG 192 
He Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg 
50 55 60 

CTG CTT TTC CGG CAG TTT TGT GAA ACC AGG CCT GGG CTG GAG TGT TAC 240 
Leu Leu Phe Arg Gin Phe Cys Glu Thr Axg Pro Gly Leu Glu Cys Tyr 
65 70 75 80 

ATT CAG TTC CTG GAC TCC GTG GCA GAA TAT GAA GTT ACT CCA GAT GAA 2 88 

He Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 
85 90 95 

AAA CTG GGA GAG AAA GGG AAG GAA ATT ATG ACC AAG TAC CTC ACC CCA 336 
Lys Leu Gly Glu Lys Gly Lys Glu He Met Thr Lys Tyr Leu Thr Pro 
100 105 HO 

AAG TCC CCT GTT TTC ATA GCC CAA GTT GGC CAA GAC CTG GTC TCC CAG 384 
Lys Ser Pro Val Phe lie Ala Gin Val Gly Gin Asp Leu Val Ser Gin 
115 120 125 

ACG GAG GAG AAG CTC CTA CAG AAG CCG TGC AAA GAA CTC TTT TCT GCC 432 
Thr Glu Glu Lys Leu Leu Gin Lys Pro Cys Lys Glu Leu Phe Ser Ala 
130 135 140 

TGT GCA CAG TCT GTC CAC GAG TAC CTG AGG GGA GAA CCA TTC CAC GAA 480 
Cys Ala Gin Ser Val His Glu Tyr Leu Arg Gly Glu Pro Phe His Glu 
145 150 155 160 

TAT CTG GAC AGC ATG TTT TTT GAC CGC TTT CTC CAG TGG AAG TGG TTG 528 
Tyr Leu Asp Ser Met Phe Phe Asp Arg Phe Leu Gin Trp Lys Trp Leu 
165 170 175 

GAA AGG CAA CCG GTG ACC AAA AAC ACT TTC AGG CAG TAT CGA GTG CTA 576 
Glu Arg Gin Pro Val Thr Lys Asn Thr Phe Arg Gin Tyr Arg Val Leu 
180 185 190 

GGA AAA GGG GGC TTC GGG GAG GTC TGT GCC TGC CAG GTT CGG GCC ACG 624 
Gly Lys Gly Gly Phe Gly Glu Val Cys Ala Cys Gin Val Axg Ala Thr 
195 200 205 

GGT AAA ATG TAT GCC TGC AAG CGC TTG GAG AAG AAG AGG ATC AAA AAG 672 
Gly Lys Met Tyr Ala Cys Lys Arg Leu Glu Lys Lys Arg He Lys Lys 
210 215 220 

AGG AAA GGG GAG TCC ATG GCC CTC AAT GAG AAG CAG ATC CTC GAG AAG 720 
Arg Lys Gly Glu Ser Met Ala Leu Asn Glu Lys Gin He Leu Glu Lys 
225 230 235 240 



GTC AAC AGT CAG TTT GTG GTC AAC CTG GCC TAT GCC TAC GAG ACC AAG 

Val Asn Ser Gin Phe Val Val Asn Leu Ala Tyr Ala Tyr Glu Thr Lys 

245 25C 255 

GAT GCA CTG TGC TTG GTC CTG ACC ATC ATG AAT GGG GGT GAC CTG AAG 



768 



816 
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Asp Ala Leu Cys Leu Val Leu Thr lie Met Asn Gly Gly Asp Leu Lys 
260 265 270 

TTC CAC ATC TAG AAC ATG GGC AAC CCT GGC TTC GAG GAG GAG CGG GCC 864 

Phe His lie Tyr Asn Met Gly Asn Pro Gly Phe Glu Glu Glu Arg Ala 
275 280 285 

TTG TTT TAT GCG GCA GAG ATC CTC TGC GGC TTA GAA GAC CTC CAC CGT 912 

Leu Phe Tyr Ala Ala Glu lie Leu Cys Gly Leu Glu Asp Leu His Arg 
290 295 300 

GAG AAC ACC GTC TAC CGA GAT CTG AAA CCT GAA AAC ATC CTG TTA GAT 96 0 

Glu Asn Thr Val Tyr Arg Asp Leu Lys Pro Glu Asn lie Leu Leu Asp 

305 310 315 320 

GAT TAT GGC CAC ATT AGG ATC TCA GAC CTG GGC TTG GCT GTG AAG ATC 1008 

Asp Tyr Gly His lie Arg lie Ser Asp Leu Gly Leu Ala Val Lys lie 
325 330 335 

CCC GAG GGA GAC CTG ATC CGC GGC CGG GTG GGC ACT GTT GGC TAC ATG 105 6 

Pro Glu Gly Asp Leu lie Arg Gly Arg Val Gly Thr Val Gly Tyr Met 
340 345 350 

GCC CCC GAA GTC CTG AAC AAC CAG AGG TAC GGC CTG AGC CCC GAC TAC 1104 

Ala Pro Glu Val Leu Asn Asn Gin Arg Tyr Gly Leu Ser Pro Asp Tyr 
355 360 365 

TGG GGC CTT GGC TGC CTC ATC TAT GAG ATG ATC GAG GGC CAG TCG CCG 1152 

Trp Gly Leu Gly Cys Leu lie Tyr Glu Met lie Glu Gly Gin Ser Pro 
370 375 380 

TTC CGC GGC CGT AAG GAG AAG GTG AAG CGG GAG GAG GTG GAC CGC CGG 12 00 

Phe Arg Gly Arg Lys Glu Lys Val Lys Arg Glu Glu Val Asp Arg Arg 

385 390 395 400 

GTC CTG GAG ACG GAG GAG GTG TAC TCC CAC AAG TTC TCC GAG GAG GCC 12 4 8 

Val Leu Glu Thr Glu Glu Val Tyr Ser His Lys Phe Ser Glu Glu Ala 
405 410 415 

AAG TCC ATC TGC AAG ATG CTG CTC ACG AAA GAT GCG AAG CAG AGG CTG 12 9 6 

Lys Ser lie Cys Lys Met Leu Leu Thr Lys Asp Ala Lys Gin Arg Leu 
420 425 430 

GGC TGC CAG GAG GAG GGG GCT GCA GAG GTC AAG AGA CAC CCC TTC TTC 13 44 

Gly Cys Gin Glu Glu Gly Ala Ala Glu Val Lys Arg His Pro Phe Phe 
435 440 445 

AGG AAC ATG AAC TTC AAG CGC TTA GAA GCC GGG ATG TTG GAC CCT CCC 1392 

Arg Asn Met Asn Phe Lys Arg Leu Glu Ala Gly Met Leu Asp Pro Pro 
450 455 460 

TTC GTT CCA GAC CCC CGC GCT GTG TAC TGT AAG GAC GTG CTG GAC ATC 144 0 

Phe Val Pro Asp Pro Arg Ala Val Tyr Cys Lys Asp Val Leu Asp lie 

465 470 475 480 

GAG CAG TTC TCC ACT GTG AAG GGC GTC AAT CTG GAC CAC ACA GAC GAC 14 88 

Glu Gin Phe Ser Thr Val Lys Gly Val Asn Leu Asp His Thr Asp Asp 
485 490 495 
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GAC TTC TAC TCC AAG TTC TCC ACG GGC TCT GTG TCC ATC CCA TGG CAA 1536 
Asp Fhe Tyr Ser Lys Phe Ser Thr Gly Ser Val Ser He Pro Trp Gin 
500 505 510 

AAC GAG ATG ATA GAA ACA GAA TGC TTT AAG GAG CTG AAC GTG TTT GGA 1584 
Asn Glu Met He Glu Thr Glu Cys Phe Lys Glu Leu Asn Val Phe Gly 
515 520 525 

CCT AAT GGT ACC CTC CCG CCA GAT CTG AAC AGA AAC CAC CCT CCG GAA 1632 
Pro Asn Gly Thr Leu Pro Pro Asp Leu Asn Arg Asn His Pro Pro Glu 
530 535 540 

CCG CCC AAG AAA GGG CTG CTC GAG AGA CTC TTC AAG CGG CAG CAT CAG 1680 
Pro Pro Lys Lys Gly Leu Leu Gin Arg Leu Phe Lys Arg Gin His Gin 
545 550 555 560 

AAC AAT TCC AAG AGT TCG CCC AGC TCC AAG ACC AGT TTT AAC CAC CAC 172 8 
Asn Asn Ser Lys Ser Ser Pro Ser Ser Lys Thr Ser Phe Asn His His 
565 570 575 

ATA AAC TCA AAC CAT GTC AGC TCG AAC TCC ACC GGA AGC AGC AGG GAT 1776 
He Asn Ser Asn His Val Ser Ser Asn Ser Thr Gly Ser Ser Arg Asp 
580 585 590 

CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 1824 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
595 600 605 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 1872 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
610 615 620 

TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 1920 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
625 630 635 640 

ACC CTC AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 1968 
Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
645 650 655 

ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2016 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
660 665 670 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 2064 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
675 680 685 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2112 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
690 695 700 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 03-2 2160 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
705 710 715 720 

ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG G3G 2208 
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lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
725 730 735 

CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2256 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala 
740 745 750 

GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 2 3 04 
Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 
755 760 765 

ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 2352 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
770 775 780 

CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC 2400 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
785 790 795 800 

ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG 2448 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
805 810 815 

GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 2496 
Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
820 825 830 

GAG CTG TAC AAG TAA 2511 
Glu Leu Tyr Lys 
835 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Glu Leu Glu Asn He Val Ala Asn Thr Val Leu Leu Lys Ala Arg 

15 10 15 

Glu Gly Gly Gly Gly Lys Arg Lys Gly Lys Ser Lys Lys Trp Lys Glu 

20 25 30 

He Leu Lys Phe Pro His He Ser Gin Cys Glu Asp Leu Arg Arg Thr 

35 40 45 

He Asp Arg Asp Tyr Cys Ser Leu Cys Asp Lys Gin Pro He Gly Arg 

50 55 60 

Leu Leu Phe Arg Gin Phe Cys Glu Thr Arg Pro Gly Leu Glu Cys Tyr 

65 70 75 80 

He Gin Phe Leu Asp Ser Val Ala Glu Tyr Glu Val Thr Pro Asp Glu 

85 90 95 

Lys Leu Gly Glu Lys Gly Lys Glu He Met Thr Lys Tyr Leu Thr Pro 
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100 

Lys Ser Pro Val 
115 

Thr Glu Glu Lys 
130 

Cys Ala Gin Ser 
145 

Tyr Leu Asp Ser 

Glu Arg Gin Pro 
180 

Gly Lys Gly Gly 
195 

Gly Lys Met Tyr 
210 

Arg Lys Gly Glu 

225 

Val Asn Ser Gin 

Asp Ala Leu Cys 
260 

Phe His lie Tyr 
275 

Leu Phe Tyr Ala 
290 

Glu Asn Thr Val 

305 

Asp Tyr Gly His 

Pro Glu Gly Asp 
340 

Ala Pro Glu Val 
355 

Trp Gly Leu Gly 
370 

Phe Arg Gly Arg 
385 

Val Leu Glu Thr 

Lys Ser lie Cys 
420 

Gly Cys Gin Glu 
435 

Arg Asn Met Asn 
450 

Phe Val Pro Asp 
465 

Glu Gin Phe Ser 

Asp Phe Tyr Ser 
500 

Asn Glu Met lie 
515 

Pro Asn Gly Thr 
530 

Pro Pro Lys Lys 
545 

Asn Asn Ser Lys 



Phe lie Ala Gin 
120 

Leu Leu Gin Lys 
135 

Val His Glu Tyr 
150 

Met Phe Phe Asp 
165 

Val Thr Lys Asn 

Phe Gly Glu Val 
200 

Ala Cys Lys Arg 
215 

Ser Met Ala Leu 
230 

Phe Val Val Asn 
245 

Leu Val Leu Thr 

Asn Met Gly Asn 
280 

Ala Glu He Leu 
295 

Tyr Arg Asp Leu 
310 

He Arg He Ser 
325 

Leu He Arg Gly 

Leu Asn Asn Gin 
360 

Cys Leu He Tyr 
375 

Lys Glu Lys Val 
390 

Glu Glu Val Tyr 
405 

Lys Met Leu Leu 

Glu Gly Ala Ala 
440 

Phe Lys Arg Leu 
455 

Pro Arg Ala Val 
470 

Thr Val Lys Gly 
485 

Lys Phe Ser Thr 

Glu Thr Glu Cys 
520 

Leu Pro Pro Asp 
535 

Gly Leu Leu Gin 
550 

Ser Ser Pro Ser 



105 

Val Gly Gin Asp 

Pro Cys Lys Glu 
140 

Leu Arg Gly Glu 
155 

Arg Phe Leu Gin 
170 

Thr Phe Arg Gin 
185 

Cys Ala Cys Gin 

Leu Glu Lys Lys 
220 

Asn Glu Lys Gin 
235 

Leu Ala Tyr Ala 
250 

lie Met Asn Gly 
265 

Pro Gly Phe Glu 

Cys Gly Leu Glu 
300 

Lys Pro Glu Asn 
315 

Asp Leu Gly Leu 
330 

Arg Val Gly Thr 
345 

Arg Tyr Gly Leu 

Glu Met lie Glu 
380 

Lys Arg Glu Glu 
395 

Ser His Lys Phe 
410 

Thr Lys Asp Ala 
425 

Glu Val Lys Arg 

Glu Ala Gly Met 

460 

Tyr Cys Lys Asp 
475 

Val Asn Leu Asp 
490 

Gly Ser Val Ser 
505 

Phe Lys Glu Leu 

Leu Asn Arg Asn 
540 

Arg Leu Phe Lys 
555 

Ser Lys Thr Ser 



110 

Leu Val Ser Gin 
125 

Leu Phe Ser Ala 

Pro Phe His Glu 
160 

Trp Lys Trp Leu 
175 

Tyr Arg Val Leu 
190 

Val Arg Ala Thr 
205 

Arg He Lys Lys 

lie Leu Glu Lys 
240 

Tyr Glu Thr Lys 
255 

Gly Asp Leu Lys 
270 

Glu Glu Arg Ala 
285 

Asp Leu His Arg 

He Leu Leu Asp 
320 

Ala Val Lys He 
335 

Val Gly Tyr Met 
350 

Ser Pro Asp Tyr 
365 

Gly Gin Ser Pro 

Val Asp Arg Arg 
400 

Ser Glu Glu Ala 
415 

Lys Gin Arg Leu 
430 

His Pro Phe Phe 
445 

Leu Asp Pro Pro 

Val Leu Asp He 
480 

His Thr Asp Asp 
495 

lie Pro Trp Gin 

510 

Asn Val Phe Gly 
525 

His Pro Pro Glu 

Arg Gin His Gin 
560 

Phe Asn His His 
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565 570 575 

lie Asn Ser Asn His Val Ser Ser Asn Ser Thr Gly Ser Ser Arg Asp 

580 585 590 

Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

595 600 605 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

610 615 620 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
625 630 635 640 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

645 650 655 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

660 665 670 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

675 680 685 

Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 

690 695 700 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
705 710 715 720 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

725 730 735 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 

740 745 750 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 

755 760 765 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 

770 775 780 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
785 790 795 800 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

805 810 815 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

820 825 830 

Glu Leu Tyr Lys 
835 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 1...1890 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

ATG AGC AGA AGC AAG CGT GAC AAC AAT TTT TAT ACT GTA GAG ATT GGA 48 
Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
15 10 15 



GAT TCT ACA TTC ACA GTC CTG AAA CGA TAT CAG AAT TTA AAA CCT ATA 
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Asp Ser Thr Phe Thr Val Leu Lys Arc Tyr Gin Asn Leu Lys Pro He 
20 25 30 

GGC TCA GGA GCT CAA GGA ATA GTA TGC GCA GCT TAT GAT GCC ATT CTT 144 
Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

GAA AGA AAT GTT GCA ATC AAG AAG CTA AGC CGA CCA TTT CAG AAT CAG 192 
Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

ACT CAT GCC AAG CGG GCC TAC AGA GAG CTA GTT CTT ATG AAA TGT GTT 240 
Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

AAT CAC AAA AAT ATA ATT GGC CTT TTG AAT GTT TTC ACA CCA CAG AAA 288 
Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

TCC CTA GAA GAA TTT CAA GAT GTT TAC ATA GTC ATG GAG CTC ATG GAT 336 
Ser Leu Glu Glu Phe Gin Asp Val Tyr lie Val Met Glu Leu Met Asp 
100 105 110 

GCA AAT CTT TGC CAA GTG ATT CAG ATG GAG CTA GAT CAT GAA AGA ATG 3 84 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 

TCC TAC CTT CTC TAT CAG ATG CTG TGT GGA ATC AAG CAC CTT CAT TCT 432 
Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 

GCT GGA ATT ATT CAT CGG GAC TTA AAG CCC AGT AAT ATA GTA GTA AAA 4 80 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

TCT GAT TGC ACT TTG AAG ATT CTT GAC TTC GGT CTG GCC AGG ACT GCA 52 8 

Ser Asp Cys Thr Leu Lys lie Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 170 175 

GGA ACG AGT TTT ATG ATG ACG CCT TAT GTA GTG ACT CGC TAC TAC AGA 57 6 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
180 185 190 

GCA CCC GAG GTC ATC CTT GGC ATG GGC TAC AAG GAA AAC GTG GAT TTA 62 4 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 
195 200 205 

TGG TCT GTG GGG TGC ATT ATG GGA GAA ATG GTT TGC CAC AAA ATC CTC 672 
Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

TTT CCA GGA AGG GAC TAT ATT GAT CAG TGG AAT AAA GTT ATT GAA CAG 72 0 

Phe Pro Gly Arg Asp Tyr lie Asp Gin Trp Asn Lys Val lie Glu Gin 
225 230 235 240 

CTT GGA ACA CCA TGT CCT GAA TTC AT3 AAG AAA CTG CAA CCA ACA GTA 7 68 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 
245 250 255 



AGG ACT TAC GTT GAA AAC AGA CCT AAA TAT GCT GGA TAT AGC TTT GAG 816 
Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
260 265 270 

AAA CTC TTC CCT GAT GTC CTT TTC CCA GCT GAC TCA GAA CAC AAC AAA 864 
Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 
275 280 285 

CTT AAA GCC AGT CAG GCA AGG GAT TTG TTA TCC AAA ATG CTG GTA ATA 912 
Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val lie 
290 295 300 

GAT GCA TCT AAA AGG ATC TCT GTA GAT GAA GCT CTC CAA CAC CCG TAC 960 
Asp Ala Ser Lys Arg He Ser Val Asp Glu Ala Leu Gin His Pro Tyr 
305 310 315 320 

ATC AAT GTC TGG TAT GAT CCT TCT GAA GCA GAA GCT CCA CCA CCA AAG 1008 
He Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 
325 330 335 

ATC CCT GAC AAG CAG TTA GAT GAA AGG GAA CAC ACA ATA GAA GAG TGG 1056 
He Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 
340 345 350 

AAA GAA TTG ATA TAT AAG GAA GTT ATG GAC TTG GAG GAG AGA ACC AAG 1104 
Lys Glu Leu He Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 
355 360 365 

AAT GGA GTT ATA CGG GGG CAG CCC TCT CCT TTA GCA CAG GTG CAG CAG 1152 
Asn Gly Val He Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 
370 375 380 

TGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC 1200 
Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 395 400 

ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 124 8 
Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 
405 410 415 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 12 96 
His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
420 425 430 

AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 1344 
Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 
435 440 445 

TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC 1392 
Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
450 455 460 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 1440 
Arg Tyr Pro A.sp His Met Lys Gin His A.sp Phe Phe Lys Ser Ala Met 
465 470 475 480 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 14 88 
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Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 
485 490 495 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 153 6 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
500 505 510 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 1584 
Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
515 520 525 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 1632 
Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
530 535 540 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 1680 
Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
545 550 555 560 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 1728 
His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 
565 570 575 

AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 177 6 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
580 585 590 

CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT 1824 
Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
595 600 605 

CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 1872 
His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thx Leu Gly 
610 615 620 

ATG GAC GAG CTG TAC AAG TAA 1893 
Met Asp Glu Leu Tyr Lys 
625 630 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 

1 5 10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 

20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
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35 40 45 

Glu Arg Asn Val Ala lie Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 

50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn lie lie Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 

85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr lie Val Met Glu Leu Met Asp 

100 105 110 

Ala Asn Leu Cys Gin Val lie Gin Met Glu Leu Asp His Glu Arg Met 

115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly lie Lys His Leu His Ser 

130 135 140 

Ala Gly lie lie His Arg Asp Leu Lys Pro Ser Asn lie Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys lie Leu Asp Phe Gly Leu Ala Arg Thr Ala 

165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 

180 185 190 

Ala Pro Glu Val lie Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 

195 200 205 

Trp Ser Val Gly Cys lie Met Gly Glu Met Val Cys His Lys lie Leu 

210 215 220 

Phe Pro Gly Arg Asp Tyr lie Asp Gin Trp Asn Lys Val lie Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 

260 265 270 

Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val lie 

290 295 300 

Asp Ala Ser Lys Arg lie Ser Val Asp Glu Ala Leu Gin His Pro Tyr 
305 310 315 320 

lie Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Pro Pro Pro Lys 

325 330 335 

lie Pro Asp Lys Gin Leu Asp Glu Arg Glu His Thr lie Glu Glu Trp 

340 345 350 

Lys Glu Leu lie Tyr Lys Glu Val Met Asp Leu Glu Glu Arg Thr Lys 

355 360 365 

Asn Gly Val lie Arg Gly Gin Pro Ser Pro Leu Ala Gin Val Gin Gin 

370 375 380 

Trp Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
385 390 395 400 

Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

405 410 415 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

420 425 430 

Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro 

435 440 445 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 

450 455 460 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
465 470 475 480 

Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 

485 490 495 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 



500 505 510 

Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 

515 520 525 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 

530 535 540 

Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
545 550 555 560 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

565 570 575 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 

580 585 590 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 

595 600 605 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 

610 615 620 

Met Asp Glu Leu Tyr Lys 
625 630 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .1818 
(D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

ATG TCT CAG GAG AGG CCC ACG TTC TAC CGG CAG GAG CTG AAC AAG ACA 48 
Met Ser Gin Glu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 
15 10 15 

ATC TGG GAG GTG CCC GAG CGT TAC CAG AAC CTG TCT CCA GTG GGC TCT 96 
He Trp Glu Val Pro Glu Arg Tyr Gin Asn Leu Ser Pro Val Gly Ser 
20 25 30 

G3C GCC TAT GGC TCT GTG TGT GCT GCT TTT GAC ACA AAA ACG GGG TTA 144 
Gly Ala Tyr Gly Ser Val Cys Ala Ala Phe Asp Thr Lys Thr Gly Leu 
35 40 45 

CGT GTG GCA GTG AAG AAG CTC TCC AGA CCA TTT CAG TCC ATC ATT CAT 192 
Arg Val Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser He He His 
50 55 60 

GIG AAA AGA ACC TAC AGA GAA CTG CGG TTA CTT AAA CAT ATG AAA CAT 240 
Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Met Lys His 
65 " 70 75 80 

GAA AAT GTG ATT GGT CTG TTG GAC GTT TTT ACA CCT GCA AG3 TCT CTG 2 88 

Glu Asn Val He Gly Leu Leu Asp Val Phe Thr Pro Ala Arg Ser Leu 
85 90 95 



GAG GAA TTC 
Glu Glu Phe 



CTG AAC AAC 
Leu Asn Asn 
115 

TTC CTT ATC 
Phe Leu lie 
130 

GAC ATA ATT 
Asp lie lie 
145 

GAC TGT GAG 
Asp Cys Glu 

GAT GAA ATG 
Asp Glu Met 

ATC ATG CTG 
lie Met Leu 
195 

GTG GGA TGC 
Val Gly Cys 
210 

GGT ACA GAC 
Gly Thr Asp 
225 

ACC CCA GGG 
Thr Pro Gly 



AAC TAT ATT 
Asn Tyr lie 



GTA TTT ATT 
Val Phe lie 
275 

CTT GTA TTG 
Leu Val Leu 
290 

CAT GCC TAC 
His Ala Tyr 

305 

GAT CCT TAT 



AAT GAT GTG 
Asn Asp Val 
100 

ATT GTG AAA 
He Val Lys 



TAC CAA ATT 
Tyr Gin He 



CAC AGG GAC 
His Arg Asp 
150 

CTG AAG ATT 
Leu Lys He 
165 

ACA GGC TAC 
Thr Gly Tyr 
180 

AAC TGG ATG 
Asn Trp Met 



ATA ATG GCC 
He Met Ala 



CAT ATT GAT 
His He Asp 
230 

GCT GAG CTT 
Ala Glu Leu 
245 

CAG TCT TTG 
Gin Ser Leu 
260 

GGT GCC AAT 
Gly Ala Asn 

GAC TCA GAT 
Asp Ser Asp 

TTT GCT CAG 
Phe Ala Gin 
310 

GAT CAG TCC 



TAT CTG GTG 
Tyr Leu Val 
105 

TGT CAG AAG 
Cys Gin Lys 
120 

CTC CGA GGT 
Leu Arg Gly 
135 

CTA AAA CCT 
Leu Lys Pro 

CTG GAT TTT 
Leu Asp Phe 



GTG GCC ACT 
Val Ala Thr 
185 

CAT TAC AAC 
His Tyr Asn 
200 

GAG CTG TTG 
Glu Leu Leu 
215 

CAG TTG AAG 
Gin Leu Lys 

TTG AAG AAA 
Leu Lys Lys 

ACT CAG ATG 
Thr Gin Met 
265 

CCC CTG GCT 
Pro Leu Ala 
280 

AAG AGA ATT 
Lys Arg lie 
295 

TAC CAC GAT 
Tyr His Asp 

TTT GAA AGO 



ACC CAT CTC 
Thr His Leu 



CTT ACA GAT 
Leu Thr Asp 

CTA AAG TAT 
Leu Lys Tyr 
140 

AGT AAT CTA 
Ser Asn Leu 
155 

GGA CTG GCT 
Gly Leu Ala 
170 

AGG TGG TAC 
Arg Trp Tyr 



CAG ACA GTT 
Gin Thr Val 



ACT GGA AGA 
Thr Gly Arg 
220 

CTC ATT TTA 
Leu He Leu 
235 

ATC TCC TCA 
He Ser Ser 
250 

CCG AAG ATG 
Pro Lys Met 



GTC GAC TTG 
Val Asp Leu 



ACA GCG GCC 
Thr Ala Ala 
300 

CCT GAT GAT 
Pro Asp Asp 
315 

AGG GAC CTC 



ATG GGG GCA 
Met Gly Ala 
110 

GAC CAT GTT 
Asp His Val 
125 

ATA CAT TCA 
He His Ser 



GCT GTG AAT 
Ala Val Asn 



CGG CAC ACA 
Arg His Thr 
175 

AGG GCT CCT 
Arg Ala Pro 
190 

GAT ATT TGG 
Asp He Trp 

205 

ACA TTG TTT 
Thr Leu Phe 



AGA CTC GTT 
Arg Leu Val 



GAG TCT GCA 
Glu Ser Ala 
255 

AAC TTT GCG 
Asn Phe Ala 
270 

CTG GAG AAG 
Leu Glu Lys 
285 

CAA GCC CTT 
Gin Ala Leu 



GAA CCA GTG 
Glu Pro Val 



CTT ATA GAT 



GAT 3 3 6 

Asp 



CAG 384 
Gin 



GCT 432 
Ala 



GAA 480 

Glu 

160 

GAT 528 
Asp 



GAG 576 
Glu 



TCA 624 
Ser 



CCT 672 
Pro 



GGA 720 

Gly 

240 

AGA 768 
Arg 



AAT 816 
Asn 



ATG 864 
Met 



GCA 912 
Ala 



GCC 960 

Ala 

320 

GAG 1008 



Asp Pro Tyr Asp Gin Ser Phe Glu Ser Arg Asp Leu Leu lie Asp Glu 
325 330 335 

TGG AAA AGC CTG ACC TAT GAT GAA GTC ATC AGC TTT GTG CCA CCA CCC 1056 
Trp Lys Ser Leu Thr Tyr Asp Glu Val lie Ser Phe Val Pro Pro Pro 
340 345 350 

CTT GAC CAA GAA GAG ATG GAG TCC GAG GAT CCA CCG GTC GCC ACC ATG 1104 
Leu Asp Gin Glu Glu Met Glu Ser Glu Asp Pro Pro Val Ala Thr Met 
355 360 365 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 1152 
Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
370 375 380 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 12 00 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
385 390 395 400 

GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC 12 48 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
405 410 415 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1296 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
420 425 430 

ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 1344 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
435 440 445 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 13 92 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
450 455 460 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 1440 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
465 470 475 480 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 1488 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
485 490 495 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 1536 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
500 505 510 

TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 15 84 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
515 520 525 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 16 32 
He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val 
530 535 540 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC 16 80 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
545 550 555 560 



GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC 172 8 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
565 570 575 

AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 1776 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
580 585 590 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 1821 

Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 605 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Met Ser Gin Glu Arg Pro Thr Phe Tyr Arg Gin Glu Leu Asn Lys Thr 

15 10 15 

lie Trp Glu Val Pro Glu Arg Tyr Gin Asn Leu Ser Pro Val Gly Ser 

20 25 30 

Gly Ala Tyr Gly Ser Val Cys Ala Ala Phe A.sp Thr Lys Thr Gly Leu 

35 40 45 

Arg Val Ala Val Lys Lys Leu Ser Arg Pro Phe Gin Ser lie He His 

50 55 60 

Ala Lys Arg Thr Tyr Arg Glu Leu Arg Leu Leu Lys His Met Lys His 
65 70 75 80 

Glu Asn Val He Gly Leu Leu Asp Val Phe Thr Pro Ala Arg Ser Leu 

85 90 95 

Glu Glu Phe Asn Asp Val Tyr Leu Val Thr His Leu Met Gly Ala Asp 

100 105 110 

Leu Asn Asn He Val Lys Cys Gin Lys Leu Thr Asp Asp His Val Gin 

115 120 125 

Phe Leu He Tyr Gin He Leu Arg Gly Leu Lys Tyr He His Ser Ala 

130 135 140 

Asp He He His Arg Asp Leu Lys Pro Ser Asn Leu Ala Val Asn Glu 
145 150 155 160 

Asp Cys Glu Leu Lys He Leu Asp Phe Gly Leu Ala Arg His Thr Asp 

165 170 175 

Asp Glu Met Thr Gly Tyr Val Ala Thr Arg Trp Tyr Arg Ala Pro Glu 

180 185 190 

He Met Leu Asn Trp Met His Tyr Asn Gin Thr Val Asp He Trp Ser 

195 200 205 

Val Gly Cys He Met Ala Glu Leu Leu Thr Gly Arg Thr Leu Phe Pro 

210 215 220 

Gly Thr Asp His He Asp Gin Leu Lys Leu He Leu Arg Leu Val Gly 
225 230 235 240 

Thr Pro Gly Ala Glu Leu Leu Lys Lys He Ser Ser Glu Ser Ala Arg 



245 250 255 

Asn Tyr lie Gin Ser Leu Thr Gin Met Pre Lys Met Asn Phe Ala Asn 

260 265 270 

Val Phe lie Gly Ala Asn Pro Leu Ala Val Asp Leu Leu Glu Lys Met 

275 280 285 

Leu Val Leu Asp Ser Asp Lys Arg lie Thr Ala Ala Gin Ala Leu Ala 

290 295 300 

His Ala Tyr Phe Ala Gin Tyr His Asp Pro Asp Asp Glu Pro Val Ala 
305 310 315 320 

Asp Pro Tyr Asp Gin Ser Phe Glu Ser Arg Asp Leu Leu He Asp Glu 

325 330 335 

Trp Lys Ser Leu Thr Tyr Asp Glu Val He Ser Phe Val Pro Pro Pro 

340 345 350 

Leu Asp Gin Glu Glu Met Glu Ser Glu Asp Pro Pro Val Ala Thr Met 

355 360 365 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

370 375 380 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
385 390 395 400 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

405 410 415 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

420 425 430 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 

435 440 445 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

450 455 460 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
465 470 475 480 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

485 490 495 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

500 505 510 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 

515 520 525 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

530 535 540 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
545 550 555 560 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

565 570 575 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

580 585 590 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 



(A) NAME /KEY : Coding Sequence 
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(B) LOCATION: 1. . .2910 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

ATG AGT GCT GAG GGG TAC CAG TAC AGA GCG CTG TAT GAT TAT AAA AAG 4 8 

Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 
15 10 15 

GAA AGA GAA GAA GAT ATT GAC TTG CAC TTG GGT GAC ATA TTG ACT GTG 96 
Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp lie Leu Thr Val 
20 25 30 

AAT AAA GGG TCC TTA GTA GCT CTT GGA TTC AGT GAT GGA CAG GAA GCC 144 
Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 
35 40 45 

AGG CCT GAA GAA ATT GGC TGG TTA AAT GGC TAT AAT GAA ACC ACA GGG 192 
Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 
50 55 60 

GAA AGG GGG GAC TTT CCG GGA ACT TAC GTA GAA TAT ATT GGA AGG AAA 240 
Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr He Gly Arg Lys 
65 70 75 80 

AAA ATC TCG CCT CCC ACA CCA AAG CCC CGG CCA CCT CGG CCT CTT CCT 288 
Lys He Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 
85 90 95 

GTT GCA CCA GGT TCT TCG AAA ACT GAA GCA GAT GTT GAA CAA CAA GCT 336 
Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val Glu Gin Gin Ala 
100 105 110 

TTG ACT CTC CCG GAT CTT GCA GAG CAG TTT GCC CCT CCT GAC ATT GCC 3 84 

Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro Pro Asp He Ala 
115 120 125 

CCG CCT CTT CTT ATC AAG CTC GTG GAA GCC ATT GAA AAG AAA GGT CTG 432 
Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu Lys Lys Gly Leu 
130 135 140 

GAA TGT TCA ACT CTA TAC AGA ACA CAG AGO TCC AGC AAC CTG GCA GAA 4 80 

Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser Asn Leu Ala Glu 
145 150 155 160 

TTA CGA CAG CTT CTT GAT TGT GAT ACA CCC TCC GTG GAC TTG GAA ATG 52 8 

Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu Met 
165 170 175 

ATC GAT GTG CAC GTT TTG GCT GAC GCT TTC AAA CGC TAT CTC CTG GAC 576 
lie Asp Val His Val Leu Ala Asp Ala Phe Lys Arg Tyr Leu Leu Asp 
180 185 190 

TTA CCA AAT CCT GTC ATT CCA GCA GCC GTT TAC AGT GAA ATG ATT TCT 624 
Leu Pro Asn Pro Val He Pro Ala Ala Val Tyr Ser Glu Met He Ser 
195 200 205 

TTA GCT CCA GAA GTA CAA AGC TCC GAA GAA TAT ATT CAG CTA TTG AAG 672 
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Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr lie Gin Leu Leu Lys 
210 215 220 

AAG CTT ATT AGG TCG CCT AGC ATA CCT CAT CAG TAT TGG CTT ACG CTT 720 
Lys Leu lie Arg Ser Pro Ser lie Pro His Gin Tyr Trp Leu Thr Leu 
225 230 235 240 

CAG TAT TTG TTA AAA CAT TTC TTC AAG CTC TCT CAA ACC TCC AGC AAA 768 
Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin Thr Ser Ser Lys 
245 250 255 

AAT CTG TTG AAT GCA AGA GTA CTC TCT GAA ATT TTC AGC CCT ATG CTT 816 
Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe Ser Pro Met Leu 
260 265 270 

TTC AGA TTC TCA GCA GCC AGC TCT GAT AAT ACT GAA AAC CTC ATA AAA 864 
Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu Asn Leu He Lys 
275 280 285 

GTT ATA GAA ATT TTA ATC TCA ACT GAA TGG AAT GAA CGA CAG CCT GCA 912 
Val He Glu He Leu He Ser Thr Glu Trp Asn Glu Arg Gin Pro Ala 
290 295 300 

CCA GCA CTG CCT CCT AAA CCA CCA AAA CCT ACT ACT GTA GCC AAC AAC 960 
Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val Ala Asn Asn 
305 310 315 320 

GGT ATG AAT AAC AAT ATG TCC TTA CAA AAT GCT GAA TGG TAC TGG GGA 1008 
Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu Trp Tyr Trp Gly 
325 330 335 

GAT ATC TCG AGG GAA GAA GTG AAT GAA AAA CTT CGA GAT ACA GCA GAC 1056 
Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg Asp Thr Ala Asp 
340 345 350 

GGG ACC TTT TTG GTA CGA GAT GCG TCT ACT AAA ATG CAT GGT GAT TAT 1104 
Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met His Gly Asp Tyr 
355 360 365 

ACT CTT ACA CTA AGG AAA GGG GGA AAT AAC AAA TTA ATC AAA ATA TTT 1152 
Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu He Lys He Phe 
370 375 380 

CAT CGA GAT GGG AAA TAT GGC TTC TCT GAC CCA TTA ACC TTC AGT TCT 1200 
His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu Thr Phe Ser Ser 
385 390 395 400 

GTG GTT GAA TTA ATA AAC CAC TAC CGG AAT GAA TCT CTA GCT CAG TAT 12 48 
Val Val Glu Leu He Asn His Tyr Arg Asn Glu Ser Leu Ala Gin Tyr 
405 410 415 

AAT CCC AAA TTG GAT GTG AAA TTA CTT TAT CCA GTA TCC AAA TAC CAA 12 96 

Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val Ser Lys Tyr Gin 
420 425 430 

CAG GAT CAA GTT GTC AAA GAA GAT AAT ATT GAA GCT GTA GGG AAA AAA 13 44 

Gin Asp Gin Val Val Lys Glu Asp Asn He Glu Ala Val Gly Lys Lys 
435 440 445 



TTA CAT GAA TAT AAC ACT CAG TTT CAA GAA AAA AGT CGA GAA TAT GAT 13 92 
Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 
450 455 460 

AGA TTA TAT GAA GAA TAT ACC CGC ACA TCC CAG GAA ATC CAA ATG AAA 1440 
Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu lie Gin Met Lys 
465 470 475 480 

AGG ACA GCT ATT GAA GCA TTT AAT GAA ACC ATA AAA ATA TTT GAA GAA 14 88 
Arg Thr Ala lie Glu Ala Phe Asn Glu Thr He Lys He Phe Glu Glu 
485 490 495 

CAG TGC CAG ACC CAA GAG CGG TAC AGC AAA GAA TAC ATA GAA AAG TTT 1536 
Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr He Glu Lys Phe 
500 505 510 

AAA CGT GAA GGC AAT GAG AAA GAA ATA CAA AGG ATT ATG CAT AAT TAT 1584 
Lys Arg Glu Gly Asn Glu Lys Glu He Gin Arg He Met His Asn Tyr 
515 520 525 

GAT AAG TTG AAG TCT CGA ATC AGT GAA ATT ATT GAC AGT AGA AGA AGA 1632 
Asp Lys Leu Lys Ser Arg He Ser Glu He He Asp Ser Arg Arg Arg 
530 535 540 

TTG GAA GAA GAC TTG AAG AAG CAG GCA GCT GAG TAT CGA GAA ATT GAC 1680 
Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr Arg Glu lie Asp 
545 550 555 560 

AAA CGT ATG AAC AGC ATT AAA CCA GAC CTT ATC CAG CTG AGA AAG ACG 1728 
Lys Arg Met Asn Ser lie Lys Pro Asp Leu He Gin Leu Arg Lys Thr 
565 570 575 

AGA GAC CAA TAC TTG ATG TGG TTG ACT CAA AAA GGT GTT CGG CAA AAG 1776 
Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly Val Arg Gin Lys 
580 585 590 

AAG TTG AAC GAG TGG TTG GGC AAT GAA AAC ACT GAA GAC CAA TAT TCA 1824 
Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu Asp Gin Tyr Ser 
595 600 605 

CTG GTG GAA GAT GAT GAA GAT TTG CCC CAT CAT GAT GAG AAG ACA TGG 1872 
Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 
610 615 620 

AAT GTT GGA AGC AGC AAC CGA AAC AAA GCT GAA AAC CTG TTG CGA GGG 1920 
Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn Leu Leu Arg Gly 
625 630 635 640 

AAG CGA GAT GGC ACT TTT CTT GTC CGG GAG AGC AGT AAA CAG GGC TGC 1968 
Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser Lys Gin Gly Cys 
645 650 655 

TAT GCC TGC TCT GTA GTG GTG GAC GGC GAA GTA AAG CAT TGT GTC ATA 2016 
Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val He 
660 665 670 

AAC AAA ACA GCA ACT GGC TAT GGC TTT GCC GAG CCC TAT AAC TTG TAC 2064 
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Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro Tyr Asn Leu Tyr 
675 680 685 

AGC TCT CTG AAA GAA CTG GTG CTA CAT TAC CAA CAC ACC TCC CTT GTG 2112 
Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His Thr Ser Leu Val 
690 695 700 

CAG CAC AAC GAC TCC CTC AAT GTC ACA CTA GCC TAC CCA GTA TAT GCA 2160 
Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr Pro Val Tyr Ala 
705 710 715 720 

CAG CAG AGG CGA CAG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC 22 08 
Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 
725 730 735 

GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC 22 56 
Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 
740 745 750 

GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT 23 04 
Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
755 760 765 

GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG 2 3 52 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys 
770 775 780 

CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG 24 00 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
785 790 795 800 

CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC 24 4 8 
Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
805 810 815 

AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 2496 
Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
820 825 830 

AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC 2 544 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
835 840 845 

GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG 25 92 
Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu 
850 855 860 

GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC 2 640 
Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
865 870 875 880 

AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC 2 6 88 

Asn Val Tyr lie Met Ala Asp Lys Gin Lys A.sn Gly lie Lys Val Asn 
885 890 895 

TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC 27 3 6 

Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp 
900 905 910 



CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC 27 84 
His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro 
915 920 925 

GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC 2 832 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
930 935 940 

GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG 2880 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
945 950 955 960 

ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 2913 
lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
965 970 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 



Met Ser Ala Glu Gly Tyr Gin Tyr Arg Ala Leu Tyr Asp Tyr Lys Lys 

15 10 15 

Glu Arg Glu Glu Asp lie Asp Leu His Leu Gly Asp lie Leu Thr Val 

20 25 30 

Asn Lys Gly Ser Leu Val Ala Leu Gly Phe Ser Asp Gly Gin Glu Ala 

35 40 45 

Arg Pro Glu Glu He Gly Trp Leu Asn Gly Tyr Asn Glu Thr Thr Gly 

50 55 60 

Glu Arg Gly Asp Phe Pro Gly Thr Tyr Val Glu Tyr He Gly Arg Lys 
65 70 75 80 

Lys He Ser Pro Pro Thr Pro Lys Pro Arg Pro Pro Arg Pro Leu Pro 

85 90 95 

Val Ala Pro Gly Ser Ser Lys Thr Glu Ala Asp Val Glu Gin Gin Ala 

100 105 HO 

Leu Thr Leu Pro Asp Leu Ala Glu Gin Phe Ala Pro Pro Asp He Ala 

115 120 125 

Pro Pro Leu Leu He Lys Leu Val Glu Ala He Glu Lys Lys Gly Leu 

130 135 140 

Glu Cys Ser Thr Leu Tyr Arg Thr Gin Ser Ser Ser Asn Leu Ala Glu 
145 150 155 160 

Leu Arg Gin Leu Leu Asp Cys Asp Thr Pro Ser Val Asp Leu Glu Met 

165 170 175 

He Asp Val His Val Leu Ala Asp Ala Phe Lys Arg Tyr Leu Leu Asp 

180 185 190 

Leu Pro Asn Pro Val He Pro Ala Ala Val Tyr Ser Glu Met He Ser 

195 200 205 

Leu Ala Pro Glu Val Gin Ser Ser Glu Glu Tyr He Gin Leu Leu Lys 



210 215 220 

Lys Leu He Arg Ser Pro Ser lie Pro His Gin Tyr Trp Leu Thr Leu 
225 230 235 240 

Gin Tyr Leu Leu Lys His Phe Phe Lys Leu Ser Gin Thr Ser Ser Lys 

245 250 255 

Asn Leu Leu Asn Ala Arg Val Leu Ser Glu He Phe Ser Pro Met Leu 

260 265 270 

Phe Arg Phe Ser Ala Ala Ser Ser Asp Asn Thr Glu Asn Leu He Lys 

275 280 285 

Val He Glu He Leu He Ser Thr Glu Trp Asn Glu Arg Gin Pro Ala 

290 295 300 

Pro Ala Leu Pro Pro Lys Pro Pro Lys Pro Thr Thr Val Ala Asn Asn 
305 310 315 320 

Gly Met Asn Asn Asn Met Ser Leu Gin Asn Ala Glu Trp Tyr Trp Gly 

325 330 335 

Asp He Ser Arg Glu Glu Val Asn Glu Lys Leu Arg Asp Thr Ala Asp 

340 345 350 

Gly Thr Phe Leu Val Arg Asp Ala Ser Thr Lys Met His Gly Asp Tyr 

355 360 365 

Thr Leu Thr Leu Arg Lys Gly Gly Asn Asn Lys Leu He Lys He Phe 

370 375 380 

His Arg Asp Gly Lys Tyr Gly Phe Ser Asp Pro Leu Thr Phe Ser Ser 
385 390 395 400 

Val Val Glu Leu He Asn His Tyr Arg Asn Glu Ser Leu Ala Gin Tyr 

405 410 415 

Asn Pro Lys Leu Asp Val Lys Leu Leu Tyr Pro Val Ser Lys Tyr Gin 

420 425 430 

Gin Asp Gin Val Val Lys Glu Asp Asn He Glu Ala Val Gly Lys Lys 

435 440 445 

Leu His Glu Tyr Asn Thr Gin Phe Gin Glu Lys Ser Arg Glu Tyr Asp 

450 455 460 

Arg Leu Tyr Glu Glu Tyr Thr Arg Thr Ser Gin Glu He Gin Met Lys 
465 470 475 480 

Arg Thr Ala He Glu Ala Phe Asn Glu Thr He Lys He Phe Glu Glu 

485 490 495 

Gin Cys Gin Thr Gin Glu Arg Tyr Ser Lys Glu Tyr He Glu Lys Phe 

500 505 510 

Lys Arg Glu Gly Asn Glu Lys Glu lie Gin Arg He Met His Asn Tyr 

515 520 525 

Asp Lys Leu Lys Ser Arg He Ser Glu He lie Asp Ser Arg Arg Arg 

530 535 540 

Leu Glu Glu Asp Leu Lys Lys Gin Ala Ala Glu Tyr Arg Glu He Asp 
545 550 555 560 

Lys Arg Met Asn Ser He Lys Pro Asp Leu He Gin Leu Arg Lys Thr 

565 570 575 

Arg Asp Gin Tyr Leu Met Trp Leu Thr Gin Lys Gly Val Arg Gin Lys 

580 585 590 

Lys Leu Asn Glu Trp Leu Gly Asn Glu Asn Thr Glu Asp Gin Tyr Ser 

595 600 605 

Leu Val Glu Asp Asp Glu Asp Leu Pro His His Asp Glu Lys Thr Trp 

610 615 620 

Asn Val Gly Ser Ser Asn Arg Asn Lys Ala Glu Asn Leu Leu Arg Gly 
625 630 635 640 

Lys Arg Asp Gly Thr Phe Leu Val Arg Glu Ser Ser Lys Gin Gly Cys 

645 650 655 

Tyr Ala Cys Ser Val Val Val Asp Gly Glu Val Lys His Cys Val He 

660 665 670 

Asn Lys Thr Ala Thr Gly Tyr Gly Phe Ala Glu Pro Tyr Asn Leu Tyr 
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675 680 685 

Ser Ser Leu Lys Glu Leu Val Leu His Tyr Gin His Thr Ser Leu Val 

690 695 700 

Gin His Asn Asp Ser Leu Asn Val Thr Leu Ala Tyr Pro Val Tyr Ala 
705 710 715 720 

Gin Gin Arg Arg Gin Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

725 730 735 

Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 

740 745 750 

Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 

755 760 765 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 

770 775 780 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
785 790 795 800 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 

805 810 815 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 

820 825 830 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 

835 840 845 

Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly lie Asp Phe Lys Glu 

850 855 860 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
865 870 875 880 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

885 890 895 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 

900 905 910 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 

915 920 925 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 

930 935 940 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
945 950 955 960 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
965 970 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1785 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



ATG GGC AAC GCC GCC GCC GCC A^G AAG GGC AGC GAG CAG GAG AGC GTG 
Met Gly Asn Ala Ala Ala Ala Lys Lys Gly Ser Glu Gin Glu Ser Val 
15 10 15 



AAA GAG TTC CTA GCC AAA GCC AAG GAA GAT TTC CTG AAA AAA TGG GAA 96 
Lys Glu Phe Leu Ala Lys Ala Lys Glu Asp Phe Leu Lys Lys Trp Glu 
20 25 30 

GAC CCC TCT CAG AAT ACA GCC CAG TTG GAT CAG TTT GAT AGA ATC AAG 144 
Asp Pro Ser Gin Asn Thr Ala Gin Leu Asp Gin Phe Asp Axg lie Lys 
35 40 45 

ACC CTT GGC ACC GGC TCC TTT GGG CGA GTG ATG CTG GTG AAG CAC AAG 192 
Thr Leu Gly Thr Gly Ser Phe Gly Arg Val Met Leu Val Lys His Lys 
50 55 60 

GAG AGT GGG AAC CAC TAC GCC ATG AAG ATC TTA GAC AAG CAG AAG GTG 240 
Glu Ser Gly Asn His Tyr Ala Met Lys lie Leu Asp Lys Gin Lys Val 
65 70 75 80 

GTG AAG CTA AAG CAG ATC GAG CAC ACT CTG AAT GAG AAG CGC ATC CTG 2 88 

Val Lys Leu Lys Gin lie Glu His Thr Leu Asn Glu Lys Arg lie Leu 
85 90 95 

CAG GCC GTC AAC TTC CCG TTC CTG GTC AAA CTT GAA TTC TCC TTC AAG 336 
Gin Ala Val Asn Phe Pro Phe Leu Val Lys Leu Glu Phe Ser Phe Lys 
100 105 110 

GAC AAC TCA AAC CTG TAC ATG GTC ATG GAG TAT GTA GCT GGT GGC GAG 384 
Asp Asn Ser Asn Leu Tyr Met Val Met Glu Tyr Val Ala Gly Gly Glu 
115 120 125 

ATG TTC TCC CAC CTA CGG CGG ATT GGA AGG TTC AGC GAG CCC CAT GCC 432 
Met Phe Ser His Leu Arg Arg lie Gly Arg Phe Ser Glu Pro His Ala 
130 135 140 

CGT TTC TAC GCG GCG CAG ATC GTC CTG ACC TTT GAG TAT CTG CAC TCC 4 80 

Arg Phe Tyr Ala Ala Gin He Val Leu Thr Phe Glu Tyr Leu His Ser 
145 150 155 160 

CTG GAC CTC ATC TAC CGG GAC CTG AAG CCC GAG AAT CTT CTC ATC GAC 528 
Leu Asp Leu He Tyr Arg Asp Leu Lys Pro Glu Asn Leu Leu He Asp 
165 170 175 

CAG CAG GGC TAT ATT CAG GTG ACA GAC TTC GGT TTT GCC AAG CGT GTG 576 
Gin Gin Gly Tyr He Gin Val Thr Asp Phe Gly Phe Ala Lys Arg Val 
180 185 190 

AAA GGC CGT ACT TGG ACC TTG TGT G3G ACC CCT GAG TAC TTG GCC CCC 624 
Lys Gly Arg Thr Trp Thr Leu Cys Gly Thr Pro Glu Tyr Leu Ala Pro 
195 200 205 

GAG ATT ATC CTG AOZ AAA GGC TAC AAC AAG GCT GTG GAC TGG TGG GCT 672 
Glu He lie Leu Ser Lys Gly Tyr Asn Lys Ala Val Asp Trp Trp Ala 
210 215 220 

CTC GGA GTC CTC ATC TAC GAG ATG G*CT GCT GGT TAC CCA CCC TTC TTC 7 2C 

Leu Gly Val Leu He Tyr Glu Met Ala Ala Gly Tyr Pro Pro Phe Phe 
225 230 235 240 

GCT GAC CAG CCT ATC CAG ATC TAT GAG AAA ATC GTC TCT GGG AAG GTG 7 68 



Ala Asp Gin Pro He Gin He Tyr Glu Lys He Val Ser Gly Lys Val 
245 250 255 

CGG TTC CCA TCC CAC TTC AGC TCT GAC TTG AAG GAC CTG CTG CGG AAC 816 
Arg Phe Pro Ser His Phe Ser Ser Asp Leu Lys Asp Leu Leu Arg Asn 
260 265 270 

CTT CTG CAA GTG GAT CTA ACC AAG CGC TTT GGA AAC CTC AAG GAC GGG 864 
Leu Leu Gin Val Asp Leu Thr Lys Arg Phe Gly Asn Leu Lys Asp Gly 
275 280 285 

GTC AAT GAC ATC AAG AAC CAC AAG TGG TTT GCC ACG ACT GAC TGG ATT 912 
Val Asn Asp He Lys Asn His Lys Trp Phe Ala Thr Thr Asp Trp He 
290 295 300 

GCC ATC TAT CAG AGA AAG GTG GAA GCT CCC TTC ATA CCA AAG TTT AAA 960 
Ala He Tyr Gin Arg Lys Val Glu Ala Pro Phe He Pro Lys Phe Lys 
305 310 315 320 

GGC CCT GGG GAC ACG AGT AAC TTT GAC GAC TAT GAG GAG GAA GAG ATC 1008 
Gly Pro Gly Asp Thr Ser Asn Phe Asp Asp Tyr Glu Glu Glu Glu He 
325 330 335 

CGG GTC TCC ATC AAT GAG AAG TGT GGC AAG GAG TTT ACT GAG TTT GGG 1056 
Arg Val Ser He Asn Glu Lys Cys Gly Lys Glu Phe Thr Glu Phe Gly 
340 345 350 

CGC GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT 1104 
Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
355 360 365 

CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT AGT 1152 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser 
370 375 380 

GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA TTT 12 00 
Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
385 390 395 400 

ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC ACT 1248 
He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
405 410 415 

ACT CTC ACT TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA GAT CAT ATG 12 96 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
420 425 430 

AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG 13 44 
Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
435 440 445 

GAA AGA ACT ATA TTT TAC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT 13 92 

Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
450 455 460 

GAA GTC AAG TTT GAA G3T GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA 144 0 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
465 470 475 480 
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GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA ATG GAA 14 88 

Gly lie Asp Fhe Lys Glu Asp Gly Asn lie Leu Gly His Lys Met Glu 
485 490 495 

TAC AAT TAT AAC TCA CAT AAT GTA TAC ATC ATG GCA GAC AAA CCA AAG 1536 
Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro Lys 
500 505 510 

AAT GGC ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT GGA 1584 
Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Lys Asp Gly 
515 520 525 

AGC GTT CAA TTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT 1632 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 
530 535 540 

GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT GCC 1680 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
545 550 555 560 

CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT GAG 1728 
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu 
565 570 575 

TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 1776 
Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
580 585 590 

CCT CAG GAG TAA 1788 
Pro Gin Glu 
595 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 59 5 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Gly Asn Ala Ala Ala Ala Lys Lys Gly Ser Glu Gin Glu Ser Val 

15 10 15 

Lys Glu Phe Leu Ala Lys Ala Lys Glu Asp Phe Leu Lys Lys Trp Glu 

20 25 30 

Asp Pro Ser Gin Asn Thr Ala Gin Leu Asp Gin Phe Asp Arg He Lys 

35 40 45 

Thr Leu Gly Thr Gly Ser Phe Gly Arg Val Met Leu Val Lys His Lys 

50 55 60 

Glu Ser Gly Asn His Tyr Ala Met Lys He Leu Asp Lys Gin Lys Val 
65 70 75 80 

Val Lys Leu Lys Gin He Glu His Thr Leu Asn Glu Lys Arg He Leu 



85 90 95 

Gin Ala Val Asn Phe Pro Phe Leu Val Lys Leu Glu Phe Ser Phe Lys 

100 105 110 

Asp Asn Ser Asn Leu Tyr Met Val Met Glu Tyr Val Ala Gly Gly Glu 

115 120 125 

Met Phe Ser His Leu Arg Arg lie Gly Arg Phe Ser Glu Pro His Ala 

130 135 140 

Arg Phe Tyr Ala Ala Gin lie Val Leu Thr Phe Glu Tyr Leu His Ser 
145 150 155 160 

Leu Asp Leu lie Tyr Arg Asp Leu Lys Pro Glu Asn Leu Leu lie Asp 

165 170 175 

Gin Gin Gly Tyr lie Gin Val Thr Asp Phe Gly Phe Ala Lys Arg Val 

180 185 190 

Lys Gly Arg Thr Trp Thr Leu Cys Gly Thr Pro Glu Tyr Leu Ala Pro 

195 200 205 

Glu He He Leu Ser Lys Gly Tyr Asn Lys Ala Val Asp Trp Trp Ala 

210 215 220 

Leu Gly Val Leu He Tyr Glu Met Ala Ala Gly Tyr Pro Pro Phe Phe 
225 230 235 240 

Ala Asp Gin Pro He Gin He Tyr Glu Lys He Val Ser Gly Lys Val 

245 250 255 

Arg Phe Pro Ser His Phe Ser Ser Asp Leu Lys Asp Leu Leu Arg Asn 

260 265 270 

Leu Leu Gin Val Asp Leu Thr Lys Arg Phe Gly Asn Leu Lys Asp Gly 

275 280 285 

Val Asn Asp He Lys Asn His Lys Trp Phe Ala Thr Thr Asp Trp He 

290 295 300 

Ala He Tyr Gin Arg Lys Val Glu Ala Pro Phe He Pro Lys Phe Lys 
305 310 315 320 

Gly Pro Gly Asp Thr Ser Asn Phe Asp Asp Tyr Glu Glu Glu Glu He 

325 330 335 

Arg Val Ser He Asn Glu Lys Cys Gly Lys Glu Phe Thr Glu Phe Gly 

340 345 350 

Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 

355 360 365 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser 

370 375 380 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
385 390 395 400 

He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

405 410 415 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

420 425 430 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 

435 440 445 

Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 

450 455 460 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 
465 470 475 480 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu 

485 490 495 

Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro Lys 

500 505 510 

Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly 

515 520 525 

Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 

530 535 540 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 



545 550 555 560 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met lie Leu Leu Glu 

565 570 575 

Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
580 585 590 

Pro Gin Glu 
595 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1...2178 
<D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG AGC GAC GTG GCT ATT GTG AAG GAG GGT TGG CTG CAC AAA CGA GGG 48 
Met Ser Asp Val Ala He Val Lys Glu Gly Trp Leu His Lys Arg Gly 
15 10 15 

GAG TAC ATC AAG ACC TGG CGG CCA CGC TAC TTC CTC CTC AAG AAT GAT 96 
Glu Tyr He Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 
20 25 30 

GGC ACC TTC ATT GGC TAC AAG GAG CGG CCG CAG GAT GTG GAC CAA CGT 144 
Gly Thr Phe He Gly Tyr Lys Glu Arg Pro Gin Asp Val Asp Gin Arg 
35 40 45 

GAG GCT CCC CTC AAC AAC TTC TCT GTG GCG CAG TGC CAG CTG ATG AAG 192 
Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gin Cys Gin Leu Met Lys 
50 55 60 

ACG GAG CGG CCC CGG CCC AAC ACC TTC ATC ATC CGC TGC CTG CAG TGG 240 
Thr Glu Arg Pro Arg Pro Asn Thr Phe He He Arg Cys Leu Gin Trp 
65 70 75 80 

ACC ACT GTC ATC GAA CGC ACC TTC CAT GTG GAG ACT CCT GAG GAG CGG 2 88 

Thr Thr Val He Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 
85 90 95 

GAG GAG TGG ACA ACC GCC ATC CAG ACT GTG GCT GAC GGC CTC AAG AAG 336 
Glu Glu Trp Thr Thr Ala He Gin Thr Val Ala Asp Gly Leu Lys Lys 
100 105 110 

CAG GAG GAG GAG GAG ATG GAC TTC C3G TCG GGC TCA CCC AGT GAC AAC 3 84 

Gin Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 
115 120 125 

TCA GGG GCT GAA GAG ATG GAG GTG TZC CTG GCC AAG CCC AAG CAC CGC 4 32 



Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 
130 135 140 

GTG ACC ATG AAC GAG TTT GAG TAC CTG AAG CTG CTG GGC AAG GGC AGT 4 80 

Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145 150 155 160 

TTC GO: AAG GTG ATC CTG GTG AAG GAG AAG GCC ACA GGC CGC TAC TAC 5 28 

Phe Gly Lys Val lie Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 
165 170 175 

GCC ATG AAG ATC CTC AAG AAG GAA GTC ATC GTG GCC AAG GAC GAG GTG 576 
Ala Met Lys He Leu Lys Lys Glu Val He Val Ala Lys Asp Glu Val 
180 185 190 

GCC CAC ACA CTC ACC GAG AAC CGC GTC CTG CAG AAC TCC AGG CAC CCC 624 
Ala His Thr Leu Thr Glu Asn Arg Val Leu Gin Asn Ser Arg His Pro 
195 200 205 

TTC CTC ACA GCC CTG AAG TAC TCT TTC CAG ACC CAC GAC CGC CTC TGC 67 2 

Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gin Thr His Asp Arg Leu Cys 
210 215 220 

TTT GTC ATG GAG TAC GCC AAC GGG GGC GAG CTG TTC TTC CAC CTG TCC 7 20 

Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225 230 235 240 

CGG GAA CGT GTG TTC TCC GAG GAC CGG GCC CGC TTC TAT GGC GCT GAG 7 68 

Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 
245 250 255 

ATT GTG TCA GCC CTG GAC TAC CTG CAC TCG GAG AAG AAC GTG GTG TAC 816 
He Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 
260 265 270 

CGG GAC CTC AAG CTG GAG AAC CTC ATG CTG GAC AAG GAC GGG CAC ATT 864 
Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His He 
275 280 285 

AAG ATC ACA GAC TTC GGG CTG TGC AAG GAG GGG ATC AAG GAC GGT GCC 912 
Lys He Thr Asp Phe Gly Leu Cys Lys Glu Gly He Lys Asp Gly Ala 
290 295 300 

ACC ATG AAG ACC TTT TGC GGC ACA CCT GAG TAC CTG GCC CCC GAG GTG 960 
Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305 310 315 320 

CTG GAG GAC AAT GAC TAC GGC CGT GCA GTG GAC TGG TGG GGG CTG GGC 1008 
Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 
325 330 335 

GTG GTC ATG TAC GAG ATG ATG TGC G3T CGC CTG CCC TTC TAC AAC CAG 1056 
Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gin 
340 345 350 

GAC CAT GAG AAG CTT TTT GAG CTC ATC CTC ATG GAG GAG ATC CGC TTC 1104 
Asp His Glu Lys Leu Phe Glu Leu He Leu Met Glu Glu He Arg Phe 
355 360 365 



9?- 



CCG CGC ACG CTT GGT CCC GAG GCC AAG TCC TTG CTT TCA GGG CTG CTC 1152 
Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 
370 375 380 

AAG AAG GAC CCC AAG CAG AGG CTT GGC GGG GGC TCC GAG GAC GCC AAG 1200 
Lys Lys Asp Pro Lys Gin Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385 390 395 400 

GAG ATC ATG CAG CAT CGC TTC TTT GCC GGT ATC GTG TGG CAG CAC GTG 1248 
Glu lie Met Gin His Arg Phe Phe Ala Gly lie Val Trp Gin His Val 
405 410 415 

TAC GAG AAG AAG CTC AGC CCA CCC TTC AAG CCC CAG GTC ACG TCG GAG 1296 
Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gin Val Thr Ser Glu 
420 425 430 

ACT GAC ACC AGG TAT TTT GAT GAG GAG TTC ACG GCC CAG ATG ATC ACC 1344 
Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gin Met lie Thr 
435 440 445 

ATC ACA CCA CCT GAC CAA GAT GAC AGC ATG GAG TGT GTG GAC AGC GAG 1392 
lie Thr Pro Pro Asp Gin Asp Asp Ser Met Glu Cys Val Asp Ser Glu 
450 455 460 

CGC AGG CCC CAC TTC CCC CAG TTC TCC TAC TCG GCC AGC AGC ACG GCC 1440 
Arg Arg Pro His Phe Pro Gin Phe Ser Tyr Ser Ala Ser Ser Thr Ala 
465 470 475 480 

TCG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC 14 88 
Ser Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 
485 490 495 

ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC 1536 
Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly 
500 505 510 

CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC 15S4 
His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
515 520 525 

AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC 1632 
Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro 
530 535 540 

TGG CCC ACC CTC GTG ACC ACC CTC ACC TAC GGC GTG CAG TGC TTC AGC 1680 
Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
545 550 555 560 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 172 8 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
565 570 575 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 177 6 

Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 
580 585 590 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 182 4 



Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
595 600 605 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 1872 
Asn Arg lie Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
610 615 620 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AG2 CAC AAC GTC TAT ATC 192 0 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
625 630 635 640 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTC AAC TTC AAG ATC CGC 1968 
Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
645 650 655 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 2 016 
His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 
660 665 670 

AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 2064 
Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
675 680 685 

CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT 2112 
Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
690 695 700 

CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 2160 
His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 

705 710 715 720 

ATG GAC GAG CTG TAC AAG TAA 2181 
Met Asp Glu Leu Tyr Lys 
725 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Ser Asp Val Ala He Val Lys Glu Gly Trp Leu His Lys Arg Gly 

15 10 15 

Glu Tyr He Lys Thr Trp Arg Pro Arg Tyr Phe Leu Leu Lys Asn Asp 

20 25 30 

Gly Thr Phe He Gly Tyr Lys Glu Arg Pro Gin Asp Val Asp Gin Arg 

35 40 45 

Glu Ala Pro Leu Asn Asn Phe Ser Val Ala Gin Cys Gin Leu Met Lys 

50 55 60 

Thr Glu Arg Pro Arg Pro Asn Thr Phe He He Arg Cys Leu Gin Trp 



65 70 75 8C 

Thr Thr Val He Glu Arg Thr Phe His Val Glu Thr Pro Glu Glu Arg 

85 90 95 

Glu Glu Trp Thr Thr Ala He Gin Thr Val Ala Asp Gly Leu Lys Lys 

100 105 110 

Gin Glu Glu Glu Glu Met Asp Phe Arg Ser Gly Ser Pro Ser Asp Asn 

115 120 125 

Ser Gly Ala Glu Glu Met Glu Val Ser Leu Ala Lys Pro Lys His Arg 

130 135 140 

Val Thr Met Asn Glu Phe Glu Tyr Leu Lys Leu Leu Gly Lys Gly Thr 
145 150 155 160 

Phe Gly Lys Val He Leu Val Lys Glu Lys Ala Thr Gly Arg Tyr Tyr 

165 170 175 

Ala Met Lys He Leu Lys Lys Glu Val He Val Ala Lys Asp Glu Val 

180 185 190 

Ala His Thr Leu Thr Glu Asn Arg Val Leu Gin Asn Ser Arg His Pro 

195 200 205 

Phe Leu Thr Ala Leu Lys Tyr Ser Phe Gin Thr His Asp Arg Leu Cys 

210 215 220 

Phe Val Met Glu Tyr Ala Asn Gly Gly Glu Leu Phe Phe His Leu Ser 
225 230 235 240 

Arg Glu Arg Val Phe Ser Glu Asp Arg Ala Arg Phe Tyr Gly Ala Glu 

245 250 255 

He Val Ser Ala Leu Asp Tyr Leu His Ser Glu Lys Asn Val Val Tyr 

260 265 270 

Arg Asp Leu Lys Leu Glu Asn Leu Met Leu Asp Lys Asp Gly His He 

275 280 285 

Lys He Thr Asp Phe Gly Leu Cys Lys Glu Gly He Lys Asp Gly Ala 

290 295 300 

Thr Met Lys Thr Phe Cys Gly Thr Pro Glu Tyr Leu Ala Pro Glu Val 
305 310 315 320 

Leu Glu Asp Asn Asp Tyr Gly Arg Ala Val Asp Trp Trp Gly Leu Gly 

325 330 335 

Val Val Met Tyr Glu Met Met Cys Gly Arg Leu Pro Phe Tyr Asn Gin 

340 345 350 

Asp His Glu Lys Leu Phe Glu Leu He Leu Met Glu Glu He Arg Phe 

355 360 365 

Pro Arg Thr Leu Gly Pro Glu Ala Lys Ser Leu Leu Ser Gly Leu Leu 

370 375 380 

Lys Lys Asp Pro Lys Gin Arg Leu Gly Gly Gly Ser Glu Asp Ala Lys 
385 390 395 400 

Glu He Met Gin His Arg Phe Phe Ala Gly He Val Trp Gin His Val 

405 410 415 

Tyr Glu Lys Lys Leu Ser Pro Pro Phe Lys Pro Gin Val Thr Ser Glu 

420 425 430 

Thr Asp Thr Arg Tyr Phe Asp Glu Glu Phe Thr Ala Gin Met lie Thr 

435 440 445 

He Thr Pro Pro Asp Gin Asp Asp Ser Met Glu Cys Val Asp Ser Glu 

450 455 460 

Arg Arg Pro His Phe Pro Gin Phe Ser Tyr Ser Ala Ser Ser Thr Ala 
465 470 475 480 

Ser Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 

485 490 495 

Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

500 505 510 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

515 520 525 

Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pre 



530 



535 



540 



Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
545 550 555 560 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 

565 570 575 

Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 

580 585 590 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 

595 600 605 

Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie 

610 615 620 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie 
625 630 635 640 

Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg 

645 650 655 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

660 665 670 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 

675 680 685 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 

690 695 700 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
705 710 715 720 

Met Asp Glu Leu Tyr Lys 
725 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...2748 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ATG GOT GAC GTT TAC CCG GCC AAC GAC TCC ACG GCG TCT CAG GAC GTG 48 
Met Ala Asp Val Tyr Pro Ala Asn Asp Ser Thr Ala Ser Gin Asp Val 
15 10 15 

GCC AAC CGC TTC GCC CGC AAA GGG GCG CTG AGG CAG AAG AAC GTG CAT 96 
Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 



20 



25 



30 



GAG GTG AAA GAC CAC AAA TTC ATC 
Glu Val Lys Asp His Lys Phe He 
35 40 



GCC CGC TTC TTC AAG CAA CCC 
Ala Arg Phe Phe Lys Gin Pro 
45 



ACC 



144 



Thr 



TTC TGC AGC CAC TGC ACC GAC TTC 
Phe Cys Ser His Cys Thr Asp Phe 
50 55 



ATC TGG GGG TTT GGG AAA CAA 
He Trp Gly Phe Gly Lys Gin 
60 



GGC 



192 



Gly 



TTC CAG TGC CAA GTT TGC TGT TTT GTG GTT CAT AAG AGG TGC CAT GAG 2 40 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

TTC GTT ACG TTC TCT TGT CCG GGT GCG GAT AAG GGA CCT GAC ACT GAC 2 88 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
85 90 95 

GAC CCC AGG AGC AAG CAC AAG TTC AAA ATC CAC ACA TAC GGA AGC CCT 336 
Asp Pro Arg Ser Lys His Lys Phe Lys lie His Thr Tyr Gly Ser Pro 
100 105 110 

ACC TTC TGT GAT CAC TGT GGG TCC CTG CTC TAT GGA CTT ATC CAC CAA 3 84 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu lie His Gin 
115 120 125 

GGG ATG AAA TGT GAC ACC TGC GAC ATG AAT GTT CAC AAC CAG TGT GTG 432 
Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Asn Gin Cys Val 
130 135 140 

ATC AAT GAC CCT AGC CTC TGC GGA ATG GAT CAC ACA GAG AAG AGG GGG 480 
He Asn Asp Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

CGG ATT TAT CTG AAG GCT GAG GTC ACT GAT GAA AAG CTC CAC GTC ACG 528 
Arg He Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 
165 170 175 

GTA CGA GAT GCA AAA AAT CTA ATC CCT ATG GAT CCA AAT GGG CTT TCG 57 6 

Val Arg Asp Ala Lys Asn Leu He Pro Met Asp Pro Asn Gly Leu Ser 
180 185 190 

GAT CCT TAT GTG AAG CTG AAA CTA ATC CCT GAC CCC AAG AAT GAG AGC 62 4 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

AAA CAG AAA ACC AAA ACC ATC CGC TCC AAC CTG AAT CCT CAG TGG AAT 67 2 

Lys Gin Lys Thr Lys Thr He Arg Ser Asn Leu Asn Pro Gin Trp Asn 
210 215 220 

GAG TCC TTC ACG TTC AAA TTA AAA CCT TCA GAC AAA GAC CGG CGA CTG 720 
Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

TCT GTA GAA ATC TGG GAC TGG GAT CGG ACG ACT CGG AAT GAC TTC ATG 768 
Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

GGA TCC CTT TCC TTT GGT GTC TCA GAG CTA ATG AAG ATG CCG GCC AGT 816 
Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 
260 265 270 

GGA TOG TAT AAA GCT CAC AAC CAA GAA GAG GGC GAA TAT TAC AAC GTG 864 
Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

CCC ATT CCA GAA GGA GAT GAA GAA GGC AAC ATG GAA CTC AGG CAG AAG 912 



Pro lie Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 

TTT GAG AAA GCC AAG CTA GGT CCT GTT GGT AAC AAA GTC ATC AGC CCT 960 

Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val lie Ser Pro 
305 310 315 320 

TCA GAA GAC AGA AAG CAA CCA TCC AAC AAC CTG GAC AGA GTG AAA CTC 1008 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
325 330 335 

ACA GAC TTC AAC TTC CTC ATG GTG CTG GGG AAG GGG AGT TTT GGG AAG 1056 
Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 
340 345 350 

GTG ATG CTT GCT GAC AGG AAG GGA ACG GAG GAA CTG TAC GCC ATC AAG 1104 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala lie Lys 

355 360 365 

ATC CTG AAG AAG GAC GTG GTG ATC CAG GAC GAC GAC GTG GAG TGC ACC 1152 

lie Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 

ATG GTG GAG AAG CGC GTG CTG GCC CTG CTG GAC AAG CCG CCA TTT CTG 1200 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

ACA CAG CTG CAC TCC TGC TTC CAG ACA GTG GAC CGG CTG TAC TTC GTC 1248 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
405 410 415 

ATG GAA TAC GTC AAC GGC GGG GAT CTT ATG TAC CAC ATT CAG CAA GTC 1296 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He Gin Gin Val 
420 425 430 

GGG AAA TTT AAG GAG CCA CAA GCA GTA TTC TAC GCA GCC GAG ATC TCC 1344 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 

435 440 445 

ATC GGA CTG TTC TTC CTT CAT AAA AGA GGG ATC ATT TAC AGG GAT CTG 1392 

He Gly Leu Phe Phe Leu His Lys Arg Gly He He Tyr Arg Asp Leu 
450 455 460 

AAG CTG AAC AAT GTC ATG CTG AAC TCA GAA GGG CAC ATC AAA ATC GCC 1440 

Lys Leu Asn Asn Val Met Leu Asn Ser Glu Gly His lie Lys He Ala 
465 470 475 480 

GAC TTC GGG ATG TGC AAG GAA CAC ATG ATG GAT GGA GTC ACG ACC AGG 1488 

Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 
485 490 495 

ACC TTC TGC GGA ACT CCG GAC TAC ATT GCC CCA GAG ATA ATC GCT TAC 1536 

Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He He Ala Tyr 
500 505 510 

CAG CCG TAC GGG AAG TCT GTA GAT TGG TGG GCG TAC GGT GTG CTG CTG 1584 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 

515 520 525 



TAC GAG ATG CTA GCC GGG CAG CCT CCG TTT GAT GGT GAA GAT GAA GAT 163 2 
Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 
530 535 540 

GAA CTG TTT CAG TCT ATA ATG GAG CAC AAC GTG TCC TAC CCC AAA TCC 1680 
Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 
545 550 555 560 

TTG TCC AAG GAA GCC GTC TCC ATC TGC AAA GGA CTT ATG ACC AAA CAG 1728 
Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys Gin 
565 570 575 

CCT GCC AAG CGA CTG GGC TGC GGG CCC GAG GGA GAG AGG GAT GTC AGA 1776 
Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 
580 585 590 

GAG CAT GCC TTC TTC AGG AGG ATC GAC TGG GAG AAA CTG GAG AAC AGG 1824 
Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 
595 600 605 

GAG ATC CAA CCA CCA TTC AAG CCC AAA GTG TGT GGC AAA GGA GCA GAA 1872 
Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
610 615 620 

AAC TTT GAC AAG TTC TTC ACG CGA GGA CAG CCT GTC TTA ACA CCA CCA 1920 
Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

GAT CAG CTG GTC ATT GCT AAC ATA GAC CAA TCT GAT TTT GAA GGG TTC 1968 
Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 
645 650 655 

TCG TAT GTC AAC CCC CAG TTT GTG CAC CCA ATC TTG CAA AGT GCA GTA 2 016 
Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 
660 665 670 

GGG CGC GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA 2064 
Gly Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
675 680 685 

ATT CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC TCT GTT 2112 
He Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val 
690 695 700 

AGT GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT AAA 2160 
Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
705 710 715 720 

TTT ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG CTT GTC 22 08 
Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pre Thr Leu Val 
725 730 735 

ACT ACT CTC ACT TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA GAT CAT 2 2 56 
Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 
740 745 750 

ATG AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA 2 304 



Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
755 760 765 

CAG GAA AGA ACT ATA TTT TAC AAA GAT GAC GGG AAC TAC AAG AC A CGT 23 52 
Gin Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
770 775 780 

GCT GAA GTC AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA 24 00 
Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
785 790 795 800 

AAA GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA ATG 2448 
Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met 
805 810 815 

GAA TAC AAT TAT AAC TCA CAT AAT GTA TAC ATC ATG GCA GAC AAA CCA 2496 
Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro 
820 825 830 

AAG AAT GGC ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AAA GAT 2544 
Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp 
835 840 845 

GGA AGC GTT CAA TTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC 2592 
Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 
850 855 860 

GAT GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACG CAA TCT 2 640 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
865 870 875 880 

GCC CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC ATG ATC CTT CTT 2688 
Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu 
885 890 895 

GAG TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC 2736 
Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 
900 905 910 

AAA CCT CAG GAG TAA 2151 
Lys Pro Gin Glu 
915 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 916 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
Met Ala Asp Val Tyr Pro Ala Asn Asp Ser Thr Ala Ser Gin Asp Val 



15 10 15 

Ala Asn Arg Fhe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 

20 25 30 

Glu Val Lys Asp His Lys Phe lie Ala Arg Phe Phe Lys Gin Pro Thr 

35 40 45 

Phe Cys Ser His Cys Thr Asp Phe He Trp Gly Phe Gly Lys Gin Gly 

50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 

85 90 95 

Asp Pro Arg Ser Lys His Lys Phe Lys He His Thr Tyr Gly Ser Pro 

100 105 HO 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 

115 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Asn Gin Cys Val 

130 135 140 

He Asn Asp Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
145 150 155 160 

Arg He Tyr Leu Lys Ala Glu Val Thr Asp Glu Lys Leu His Val Thr 

165 170 175 

Val Arg Asp Ala Lys Asn Leu He Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 

195 200 205 

Lys Gin Lys Thr Lys Thr He Arg Ser Asn Leu Asn Pro Gin Trp Asn 

210 215 220 

Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 

245 250 255 

Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 

260 265 270 

Gly Trp Tyr Lys Ala His Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 

275 280 285 

Pro He Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 

290 295 300 

Phe Glu Lys Ala Lys Leu Gly Pro Val Gly Asn Lys Val He Ser Pro 
305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 

325 330 335 

Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

340 345 350 

Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala He Lys 

355 360 365 

He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 

370 375 380 

Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 

405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He Gin Gin Val 

420 425 430 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu He Ser 

435 440 445 

He Gly Leu Phe Phe Leu His Lys Arg Gly He lie Tyr Arg Asp Leu 

450 455 460 

Lys Leu Asn Asn Val Met Leu Asn Ser Glu Gly His He Lys He Ala 
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465 470 475 480 

Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 

485 490 495 

Thr Phe Cys Gly Thr Pro Asp Tyr lie Ala Pro Glu lie He Ala Tyr 

500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 

515 520 525 

Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 

530 535 540 

Glu Leu Phe Gin Ser He Met Glu His Asn Val Ser Tyr Pro Lys Ser 
545 550 555 560 

Leu Ser Lys Glu Ala Val Ser He Cys Lys Gly Leu Met Thr Lys Gin 

565 570 575 

Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 

580 585 590 

Glu His Ala Phe Phe Arg Arg He Asp Trp Glu Lys Leu Glu Asn Arg 

595 600 605 

Glu He Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 

610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

Asp Gin Leu Val He Ala Asn He Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 

Ser Tyr Val Asn Pro Gin Phe Val His Pro He Leu Gin Ser Ala Val 

660 665 670 

Gly Arg Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 

675 680 685 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val 

690 695 700 

Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
705 710 715 720 

Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

725 730 735 

Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

740 745 750 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 

755 760 765 

Gin Glu Arg Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg 

770 775 780 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
785 790 795 800 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met 

805 810 815 

Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro 

820 825 830 

Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn He Lys Asp 

835 840 845 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

850 855 860 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
865 870 875 880 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu 

885 890 895 

Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 

900 905 910 

Lys Pro Gin Glu 
915 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2154 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

ATG TCG TCC ATC TTG CCA TTC ACG CCG CCA GTT GTG AAG AGA CTG CTG 48 
Met Ser Ser lie Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 
15 10 15 

GGA TGG AAG AAG TCA GCT GGT GGG TCT GGA GGA GCA GGC GGA GGA GAG 96 
Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
20 25 30 

CAG AAT GGG CAG GAA GAA AAG TGG TGT GAG AAA GCA GTG AAA AGT CTG 144 
Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
35 40 45 

GTG AAG AAG CTA AAG AAA ACA GGA CGA TTA GAT GAG CTT GAG AAA GCC 192 
Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
50 55 60 

ATC ACC ACT CAA AAC TGT AAT ACT AAA TGT GTT ACC ATA CCA AGC ACT 240 
He Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr He Pro Ser Thr 
65 70 75 80 

TGC TCT GAA ATT TGG GGA CTG AGT ACA CCA AAT ACG ATA GAT CAG TGG 2 88 

Cys Ser Glu He Trp Gly Leu Ser Thr Pro Asn Thr He Asp Gin Trp 
85 90 95 

GAT ACA ACA GGC CTT TAC AGC TTC TCT GAA CAA ACC AGG TCT CTT GAT 336 
Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 
100 105 110 

GGT CGT CTC CAG GTA TCC CAT CGA AAA GGA TTG CCA CAT GTT ATA TAT 3 84 

Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro His Val lie Tyr 
115 120 125 

TGC CGA TTA TGG CGC TGG CCT GAT CTT CAC AGT CAT CAT GAA CTC AAG 432 
Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 
130 135 140 

GCA ATT GAA AAC TGC GAA TAT G^T TTT AAT CTT AAA AAG GAT GAA GTA 4 80 

Ala He Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145 150 155 160 

TGT GTA AAC CCT TAC CAC TAT CAG AGA GTT GAG ACA CCA GTT TTG CCT 52 8 
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Cys Val Asn Pro Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro 
165 170 175 

CCA GTA TTA GTG CCC CGA CAC ACC GAG ATC CTA ACA GAA CTT CCG CCT 576 
Pro Val Leu Val Pro Arg His Thr Glu lie Leu Thr Glu Leu Pro Pre 
180 185 190 

CTG GAT GAC TAT ACT CAC TCC ATT CCA GAA AAC ACT AAC TTC CCA GCA 624 
Leu Asp Asp Tyr Thr His Ser lie Pro Glu Asn Thr Asn Phe Pro Ala 
195 200 205 

GGA ATT GAG CCA CAG AGT AAT TAT ATT CCA GAA ACG CCA CCT CCT GGA 67 2 

Gly He Glu Pro Gin Ser Asn Tyr He Pro Glu Thr Pro Pro Pro Gly 
210 215 220 

TAT ATC AGT GAA GAT GGA GAA ACA AGT GAC CAA CAG TTG AAT CAA AGT 720 
Tyr He Ser Glu Asp Gly Glu Thr Ser Asp Gin Gin Leu Asn Gin Ser 
225 230 235 240 

ATG GAC ACA GGC TCT CCA GCA GAA CTA TCT CCT ACT ACT CTT TCC CCT 7 68 

Met Asp Thr Gly Ser Pro Ala Glu Leu Ser Pro Thr Thr Leu Ser Pro 
245 250 255 

GTT AAT CAT AGC TTG GAT TTA CAG CCA GTT ACT TAC TCA GAA CCT GCA 816 
Val Asn His Ser Leu Asp Leu Gin Pro Val Thr Tyr Ser Glu Pro Ala 
260 265 270 

TTT TGG TGT TCA ATA GCA TAT TAT GAA TTA AAT CAG AGG GTT GGA GAA 864 
Phe Trp Cys Ser He Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu 

275 280 285 

ACC TTC CAT GCA TCA CAG CCC TCA CTC ACT GTA GAT GGC TTT ACA GAC 912 
Thr Phe His Ala Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp 
290 295 300 

CCA TCA AAT TCA GAG AGG TTC TGC TTA GGT TTA CTC TCC AAT GTT AAC 960 
Pro Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn 
305 310 315 320 

CGA AAT GCC ACG GTA GAA ATC ACA AGA AGG CAT ATA GGA AGA GGA GTG 1008 
Arg Asn Ala Thr Val Glu Met Thr Arg Arg His He Gly Arg Gly Val 
325 330 335 

CGC TTA TAC TAC ATA GGT GGG GAA GTT TTT GCT GAG TGC CTA AGT GAT 1056 
Arg Leu Tyr Tyr He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp 
340 345 350 

AGT GCA ATC TTT GTG CAG AG 2 CCC AAT TGT AAT CAG AGA TAT GGC TGG 1104 
Ser Ala He Phe Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp 
355 360 365 

CAC CCT GCA ACA GTG TGT AAA ATT CCA CCA GGC TGT AAT CTG AAG ATC 1152 
His Pro Ala Thr Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He 
370 375 380 

TTC AAC AAC CAG GAA TTT GCT GCT CTT CTG GCT CAG TCT GTT AAT CAG 12 00 

Phe Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin 
385 390 395 400 



GGT TTT GAA GCC GTC TAT CAG CTA ACT AGA ATG TGC ACC ATA AGA ATG 1248 
Gly Phe Glu Ala Val Tyr Gin Leu Thr Arg Met Cys Thr lie Arg Met 
405 410 415 

AGT TTT GTG AAA GGG TGG GGA GCA GAA TAC CGA AGG CAG ACG GTA ACA 12 96 
Ser Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr 
420 425 430 

AGT ACT CCT TGC TGG ATT GAA CTT CAT CTG AAT GGA CCT CTA CAG TGG 13 44 
Ser Thr Pro Cys Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp 
435 440 445 

TTG GAC AAA GTA TTA ACT CAG ATG GGA TCC CCT TCA GTG CGT TGC TCA 13 92 
Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser 
450 455 460 

AGC ATG TCA TGG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG 1440 
Ser Met Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met 
465 470 475 480 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 1488 
Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
485 490 495 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 1536 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
500 505 510 

GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC 1584 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
515 520 525 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1632 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
530 535 540 

ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 1680 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
545 550 555 560 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 1728 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
565 570 575 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 1776 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
580 585 590 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 1824 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
595 600 605 

GAC TTC AAG GAG GAC GGC AAC ATC CTC GGG CAC AAG CTG GAG TAC AAC 1872 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
610 615 620 

TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 1920 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
625 630 635 640 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 1968 
He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
645 650 655 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC 2016 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
660 665 670 

GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC 2064 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
675 680 685 

AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 2112 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
690 695 700 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 2157 
Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
705 710 715 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Met Ser Ser He Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

15 10 15 

Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 

20 25 30 

Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 

35 40 45 

Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 

50 55 60 

lie Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr lie Pro Ser Thr 
65 70 75 80 

Cys Ser Glu lie Trp Gly Leu Ser Thr Pro Asn Thr lie Asp Gin Trp 

85 90 95 

Asp Thr Thr Gly Leu Tyr Ser Phe Ser Glu Gin Thr Arg Ser Leu Asp 

100 105 110 

Gly Arg Leu Gin Val Ser His Arg Lys Gly Leu Pro His Val He Tyr 

115 120 125 

Cys Arg Leu Trp Arg Trp Pro Asp Leu His Ser His His Glu Leu Lys 

130 135 140 

Ala lie Glu Asn Cys Glu Tyr Ala Phe Asn Leu Lys Lys Asp Glu Val 
145 150 155 160 

Cys Val Asn Pro Tyr His Tyr Gin Arg Val Glu Thr Pro Val Leu Pro 
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Pro Val Leu Val 
180 

Leu Asp Asp Tyr 
195 

Gly He Glu Pro 
210 

Tyr He Ser Glu 
225 

Met Asp Thr Gly 

Val Asn His Ser 
260 

Phe Trp Cys Ser 
275 

Thr Phe His Ala 
290 

Pro Ser Asn Ser 
305 

Arg Asn Ala Thr 

Arg Leu Tyr Tyr 
340 

Ser Ala He Phe 
355 

His Pro Ala Thr 

370 

Phe Asn Asn Gin 
385 

Gly Phe Glu Ala 

Ser Phe Val Lys 
420 

Ser Thr Pro Cys 
435 

Leu Asp Lys Val 
450 

Ser Met Ser Trp 
465 

Val Ser Lys Gly 

Glu Leu Asp Gly 
SCC- 
dy Glu Gly Asp 
515 

Thr Thr Gly Lys 
530 

Thr Tyr Gly Val 
545 

His Asp Phe Phe 

Thr He Phe Phe 
5 80 

Lys Phe Glu Gly 
595 

Asp Phe Lys Glu 
610 

Tyr Asn Ser His 
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Pro Arg His Thr 

Thr His Ser He 
200 

Gin Ser Asn Tyr 
215 

Asp Gly Glu Thr 

230 

Ser Pro Ala Glu 
245 

Leu Asp Leu Gin 

He Ala Tyr Tyr 
280 

Ser Gin Pro Ser 
295 

Glu Arg Phe Cys 
310 

Val Glu Met Thr 
325 

He Gly Gly Glu 

Val Gin Ser Pro 
360 

Val Cys Lys He 
375 

Glu Phe Ala Ala 
390 

Val Tyr Gin Leu 
405 

Gly Trp Gly Ala 

Trp He Glu Leu 
440 

Leu Thr Gin Met 
455 

Val Pro Arg Ala 
470 

Glu Glu Leu Phe 
485 

Asp Val Asn Gly 

Ala Thr Tyr Gly 
520 

Leu Pro Val Pro 
535 

Gin Cys Phe Ser 
550 

Lys Ser Ala Met 
565 

Lys Asp Asp Gly 

Asp Thr Leu Val 
600 

Asp Gly Asn lie 
615 

Asn Val Tyr He 



170 

Glu He Leu Thr 
185 

Pro Glu Asn Thr 

He Pro Glu Thr 
220 

Ser Asp Gin Gin 
235 

Leu Ser Pro Thr 
250 

Pro Val Thr Tyr 
265 

Glu Leu Asn Gin 

Leu Thr Val Asp 
300 

Leu Gly Leu Leu 
315 

Arg Arg His lie 
330 

Val Phe Ala Glu 
345 

Asn Cys Asn Gin 

Pro Pro Gly Cys 
380 

Leu Leu Ala Gin 
395 

Thr Arg Met Cys 
410 

Glu Tyr Arg Arg 
425 

His Leu Asn Gly 

Gly Ser Pro Ser 
460 

Arg Asp Pro Pro 
475 

Thr Gly Val Val 
490 

His Lys Phe Ser 
505 

Lys Leu Thr Leu 

Trp Pro Thr Leu 
540 

Arg Tyr Pro Asp 
555 

Pro Glu Gly Tyr 
570 

Asn Tyr Lys Thr 
585 

Asn Arg He Glu 

Leu Gly His Lys 
620 

Met Ala Asp Lys 
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Glu Leu Pro Pro 
190 

Asn Phe Pro Ala 
205 

Pro Pro Pro Gly 

Leu Asn Gin Ser 
240 

Thr Leu Ser Pro 
255 

Ser Glu Pro Ala 
270 

Arg Val Gly Glu 
285 

Gly Phe Thr Asp 

Ser Asn Val Asn 
320 

Gly Arg Gly Val 
335 

Cys Leu Ser Asp 
350 

Arg Tyr Gly Trp 
365 

Asn Leu Lys He 

Ser Val Asn Gin 
400 

Thr lie Arg Met 
415 

Gin Thr Val Thr 
430 

Pro Leu Gin Trp 

445 

Val Arg Cys Ser 

Val Ala Thr Met 
480 

Pro He Leu Val 
495 

Val Ser Gly Glu 
510 

Lys Phe He Cys 
525 

Val Thr Thr Leu 

His Met Lys Gin 
560 

Val Gin Glu Arg 
575 

Arg Ala Glu Val 
590 

Leu Lys Gly He 

605 

Leu Glu Tyr Asn 
Gin Lys Asn Gly 



625 630 635 640 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

645 650 655 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 

660 665 670 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

675 680 685 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

690 695 700 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
705 710 715 



(2) INFORMATION FOR SEQ ID NO: 76: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 1 . . .2394 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



ATG GAC AAT ATG TCT ATT ACG AAT ACA CCA ACA AGT AAT GAT GCC TGT 4 8 

Met Asp Asn Met Ser lie Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 
15 10 15 



CTG AGC ATT GTG CAT AGT TTG ATG TGC CAT AGA CAA GGT GGA GAG AGT 96 
Leu Ser He Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser 
20 25 30 



GAA ACA TTT GCA AAA AGA GCA ATT GAA AGT TTG GTA AAG AAG CTG AAG 144 
Glu Thr Phe Ala Lys Arg Ala He Glu Ser Leu Val Lys Lys Leu Lys 
35 40 45 



GAG AAA AAA GAT GAA TTG GAT TCT TTA ATA ACA GCT ATA ACT ACA AAT 192 
Glu Lys Lys Asp Glu Leu Asp Ser Leu lie Thr Ala lie Thr Thr Asn 
50 55 60 



GGA GCT CAT CCT AGT AAA TGT GTT ACC ATA CAG AGA ACA TTG GAT GGG 24 0 

Gly Ala His Pro Ser Lys Cys Val Thr He Gin Arg Thr Leu Asp Gly 
65 70 75 80 



AGG CTT CAG GTG GCT GGT CGG AAA GGA TTT CCT CAT GTG ATC TAT GCC 2 88 

Arg Leu Gin Val Ala Gly Arg Lys Gly Phe Pro His Val lie Tyr Ala 
85 90 95 



CGT CTC TGG AGG TGG CCT GAT CTT CAC AAA AAT GAA CTA AAA CAT GTT 336 
Arg Leu Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val 
100 105 110 



AAA TAT TGT CAG TAT GCG TTT GAC TTA AAA TGT GAT AGT GTC TGT GTG 3 84 
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Lys Tyr Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val 
115 120 125 

AAT CCA TAT CAC TAC GAA CGA GTT GTA TCA CCT GGA ATT GAT CTC TCA 432 
Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly He Asp Leu Ser 
130 135 140 

GGA TTA ACA CTG CAG AGT AAT GCT CCA TCA AGT ATG ATG GTG AAG GAT 480 
Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 
145 150 155 160 

GAA TAT GTG CAT GAC TTT GAG GGA CAG CCA TCG TTG TCC ACT GAA GGA 528 
Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly 
165 170 175 

CAT TCA ATT CAA ACC ATC CAG CAT CCA CCA AGT AAT CGT GCA TCG ACA 57 6 

His Ser He Gin Thr He Gin His Pro Pro Ser Asn Arg Ala Ser Thr 
180 185 190 

GAG ACA TAC AGC ACC CCA GCT CTG TTA GCC CCA TCT GAG TCT AAT GCT 624 
Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 
195 % 200 205 

ACC AGC ACT GCC AAC TTT CCC AAC ATT CCT GTG GCT TCC ACA AGT CAG 672 
Thr Ser Thr Ala Asn Phe Pro Asn He Pro Val Ala Ser Thr Ser Gin 
210 215 220 

CCT GCC AGT ATA CTG GGG GGC AGC CAT AGT GAA GGA CTG TTG CAG ATA 720 
Pro Ala Ser He Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin He 
225 230 235 240 

GCA TCA GGG CCT CAG CCA GGA CAG CAG CAG AAT GGA TTT ACT GGT CAG 7 68 

Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin 
245 250 255 

CCA GCT ACT TAC CAT CAT AAC AGC ACT ACC ACC TGG ACT GGA AGT AGG 816 
Pro Ala Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg 
260 265 270 

ACT GCA CCA TAC ACA CCT AAT TTG CCT CAC CAC CAA AAC GGC CAT CTT 864 
Thr Ala Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu 
275 280 285 

CAG CAC CAC CCG CCT ATG CCG CCC CAT CCC GGA CAT TAC TGG CCT GTT 912 
Gin His His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val 
290 295 300 

CAC AAT GAG CTT GCA TTC CAG CCT CCC ATT TCC AAT CAT CCT GCT CCT 960 
His Asn Glu Leu Ala Phe Gin Pro Pro He Ser Asn His Pro Ala Pro 
305 310 315 320 

GAG TAT TGG TGT TCC ATT GCT TAC TTT GAA ATG GAT GTT CAG GTA GGA 1008 
Glu Tyr Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly 
325 330 335 

GAG ACA TTT AAG GTT CCT TCA AGC TGC CCT ATT GTT ACT GTT GAT GGA 1056 
Glu Thr Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly 
340 345 350 
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TAC GTG GAC CCT TCT GGA GGA GAT CGC TTT TGT TTG GGT CAA CTC TCC 1104 
Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 
355 360 365 

AAT GTC CAC AGG ACA GAA GCC ATT GAG AGA GCA AGG TTG CAC ATA GGC 1152 
Asn Val His Arg Thr Glu Ala lie Glu Axg Ala Arg Leu His lie Gly 
370 375 380 

AAA GGT GTG CAG TTG GAA TGT AAA GGT GAA GGT GAT GTT TGG GTC AGG 12 00 

Lys Gly Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg 
385 390 395 400 

TGC CTT AGT GAC CAC GCG GTC TTT GTA CAG AGT TAC TAC TTA GAC AGA 1248 
Cys Leu Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg 
405 410 415 

GAA GCT GGG CGT GCA CCT GGA GAT GCT GTT CAT AAG ATC TAC CCA AGT 1296 
Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser 
420 425 430 

GCA TAT ATA AAG GTC TTT GAT TTG CGT CAG TGT CAT CGA CAG ATG CAG 1344 
Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 
435 440 445 

CAG CAG GCG GCT ACT GCA CAA GCT GCA GCA GCT GCC CAG GCA GCA GCC 13 92 
Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala 
450 455 460 

GTG GCA GGA AAC ATC CCT GGC CCA GGA TCA GTA GGT GGA ATA GCT CCA 1440 
Val Ala Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro 
465 470 475 480 

GCT ATC AGT CTG TCA GCT GCT GCT GGA ATT GGT GTT GAT GAC CTT CGT 1488 
Ala lie Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg 
485 490 495 

CGC TTA TGC ATA CTC AGG ATG AGT TTT GTG AAA GGC TGG GGA CCG GAT 1536 
Arg Leu Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 
500 505 510 

TAC CCA AGA CAG AGC ATC AAA GAA ACA CCT TGC TGG ATT GAA ATT CAC 1584 
Tyr Pro Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu lie His 
515 520 525 

TTA CAC CGG GCC CTC CAG CTC CTA GAC GAA GTA CTT CAT ACC ATG CCG 1632 
Leu His Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro 
530 535 540 

ATT GCA GAC CCA CAA CCT TTA GAC TGG GAT CCA CCG GTC GCC ACC ATG 1680 
lie Ala Asp Pro Gin Pro Leu Asp Trp Asp Pro Pro Val Ala Thr Met 
545 550 555 560 

GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC 172 8 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
565 570 575 

GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG 17 76 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
S80 585 590 

GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC 1824 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
595 600 605 

ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG 1872 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
610 615 620 

ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG 1920 
Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
625 630 635 640 

CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC 1968 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
645 650 655 

ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG 2016 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
660 665 670 

AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC 2064 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
675 680 685 

GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC 2112 
Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
690 695 700 

TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC 2160 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
705 710 715 720 

ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG 2208 
lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val 
725 730 735 

CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC 22 56 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
740 745 750 

GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC 23 04 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
755 760 765 

AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG 2 3 52 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
770 775 780 

ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 2 3 97 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 77: 



//6 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

Met Asp Asn Met Ser lie Thr Asn Thr Pro Thr Ser Asn Asp Ala Cys 

15 10 15 

Leu Ser lie Val His Ser Leu Met Cys His Arg Gin Gly Gly Glu Ser 

20 25 30 

Glu Thr Phe Ala Lys Arg Ala lie Glu Ser Leu Val Lys Lys Leu Lys 

35 40 45 

Glu Lys Lys Asp Glu Leu Asp Ser Leu lie Thr Ala lie Thr Thr Asn 

50 55 60 

Gly Ala His Pro Ser Lys Cys Val Thr lie Gin Arg Thr Leu Asp Gly 
65 70 75 80 

Arg Leu Gin Val Ala Gly Arg Lys Gly Phe Pro His Val lie Tyr Ala 

85 90 95 

Arg Leu Trp Arg Trp Pro Asp Leu His Lys Asn Glu Leu Lys His Val 

100 105 110 

Lys Tyr Cys Gin Tyr Ala Phe Asp Leu Lys Cys Asp Ser Val Cys Val 

115 120 125 

Asn Pro Tyr His Tyr Glu Arg Val Val Ser Pro Gly lie Asp Leu Ser 

130 135 140 

Gly Leu Thr Leu Gin Ser Asn Ala Pro Ser Ser Met Met Val Lys Asp 
145 150 155 160 

Glu Tyr Val His Asp Phe Glu Gly Gin Pro Ser Leu Ser Thr Glu Gly 

165 170 175 

His Ser lie Gin Thr lie Gin His Pro Pro Ser Asn Arg Ala Ser Thr 

180 185 190 

Glu Thr Tyr Ser Thr Pro Ala Leu Leu Ala Pro Ser Glu Ser Asn Ala 

195 200 205 

Thr Ser Thr Ala Asn Phe Pro Asn lie Pro Val Ala Ser Thr Ser Gin 

210 215 220 

Pro Ala Ser lie Leu Gly Gly Ser His Ser Glu Gly Leu Leu Gin lie 
225 230 235 240 

Ala Ser Gly Pro Gin Pro Gly Gin Gin Gin Asn Gly Phe Thr Gly Gin 

245 250 255 

Pro Ala Thr Tyr His His Asn Ser Thr Thr Thr Trp Thr Gly Ser Arg 

260 265 270 

Thr Ala Pro Tyr Thr Pro Asn Leu Pro His His Gin Asn Gly His Leu 

275 280 285 

Gin His His Pro Pro Met Pro Pro His Pro Gly His Tyr Trp Pro Val 

290 295 300 

His Asn Glu Leu Ala Phe Gin Pro Pro lie Ser Asn His Pro Ala Pro 
305 310 315 320 

Glu Tyr Trp Cys Ser He Ala Tyr Phe Glu Met Asp Val Gin Val Gly 

325 330 335 

Glu Thr Phe Lys Val Pro Ser Ser Cys Pro He Val Thr Val Asp Gly 

340 345 350 

Tyr Val Asp Pro Ser Gly Gly Asp Arg Phe Cys Leu Gly Gin Leu Ser 

355 360 365 

Asn Val His Arg Thr Glu Ala He Glu Arg Ala Arg Leu His He Gly 



//?- 



370 375 380 

Lys Gly Val Gin Leu Glu Cys Lys Gly Glu Gly Asp Val Trp Val Arg 
385 390 395 400 

Gys Leu Ser Asp His Ala Val Phe Val Gin Ser Tyr Tyr Leu Asp Arg 

405 410 415 

Glu Ala Gly Arg Ala Pro Gly Asp Ala Val His Lys He Tyr Pro Ser 

420 425 430 

Ala Tyr He Lys Val Phe Asp Leu Arg Gin Cys His Arg Gin Met Gin 

435 440 445 

Gin Gin Ala Ala Thr Ala Gin Ala Ala Ala Ala Ala Gin Ala Ala Ala 

450 455 460 

Val Ala Gly Asn He Pro Gly Pro Gly Ser Val Gly Gly He Ala Pro 
465 470 475 480 

Ala He Ser Leu Ser Ala Ala Ala Gly He Gly Val Asp Asp Leu Arg 

485 490 495 

Arg Leu Cys He Leu Arg Met Ser Phe Val Lys Gly Trp Gly Pro Asp 

500 505 510 

Tyr Pro Arg Gin Ser He Lys Glu Thr Pro Cys Trp He Glu He His 

515 520 525 

Leu His Arg Ala Leu Gin Leu Leu Asp Glu Val Leu His Thr Met Pro 

530 535 540 

He Ala Asp Pro Gin Pro Leu Asp Trp Asp Pro Pro Val Ala Thr Met 
545 550 555 560 

Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 

565 570 575 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

580 585 590 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

595 600 605 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

610 615 620 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
625 630 635 640 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

645 650 655 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

660 665 670 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

675 680 685 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 

690 695 700 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
705 710 715 720 

lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

725 730 735 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

740 745 750 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

755 760 765 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

770 775 780 

Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3138 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1 . . .3135 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

ATG GCG GGC TGG ATC CAG GCC CAG CAG CTG CAG GGA GAC GCG CTG CGC 48 
Met Ala Gly Trp lie Gin Ala Gin Gin Leu Gin Gly Asp Ala Leu Arg 
15 10 15 

CAG ATG CAG GTG CTG TAC GGC CAG CAC TTC CCC ATC GAG GTC CGG CAC 96 
Gin Met Gin Val Leu Tyr Gly Gin His Phe Pro lie Glu Val Arg His 
20 25 30 

TAC TTG GCC CAG TGG ATT GAG AGO CAG CCA TGG GAT GCC ATT GAC TTG 144 
Tyr Leu Ala Gin Trp lie Glu Ser Gin Pro Trp Asp Ala lie Asp Leu 
35 40 45 

GAC AAT CCC CAG GAC AGA GCC CAA GCC ACC CAG CTC CTG GAG GGC CTG 192 
Asp Asn Pro Gin Asp Arg Ala Gin Ala Thr Gin Leu Leu Glu Gly Leu 
50 55 60 

GTG CAG GAG CTC CAG AAG AAG GCG GAG CAC CAG GTG GGG GAA GAT GGG 2 40 

Val Gin Glu Leu Gin Lys Lys Ala Glu His Gin Val Gly Glu Asp Gly 
65 70 75 80 

TTT TTA CTG AAG ATC AAG CTG GGG CAC TAC GCC ACG CAG CTC CAG AAA 2 88 

Phe Leu Leu Lys lie Lys Leu Gly His Tyr Ala Thr Gin Leu Gin Lys 
85 90 95 

ACA TAT GAC CGC TGC CCC CTG GAG CTG GTC CGC TGC ATC CGG CAC ATT 336 
Thr Tyr Asp Arg Cys Pro Leu Glu Leu Val Arg Cys lie Arg His lie 
100 105 110 

CTG TAC AAT GAA CAG AGG CTG GTC CGA GAA GCC AAC AAT TGC AGC TCT 3 84 

Leu Tyr Asn Glu Gin Arg Leu Val Arg Glu Ala Asn Asn Cys Ser Ser 
115 120 125 

CCG GCT GGG ATC CTC GTT GAC GCC ATG TCC CAG AAG CAC CTT CAG ATC 4 32 

Pro Ala Gly He Leu Val Asp Ala Met Ser Gin Lys His Leu Gin He 
130 135 140 

AAC CAG ACA TTT GAG GAG CTG CGA CTG GTC ACG CAG GAC ACA GAG AAT 4 80 

Asn Gin Thr Phe Glu Glu Leu Arg Leu Val Thr Gin Asp Thr Glu Asn 
145 150 155 160 

GAG CTG AAG AAA CTC CAG CAG ACT CAG GAG TAC TTC ATC ATC CAG TAC 52 8 

Glu Leu Lys Lys Leu Gin Gin Thr Gin Glu Tyr Phe He He Gin Tyr 
165 170 175 

CAG GAG AGC CTG AG 3 ATC CAA GCT CAG TTT GCC CAG CTG GCC CAG CTG 57 6 



Gin Glu Ser Leu Arg He Gin Ala Gin Phe Ala Gin Leu Ala Gin Leu 
180 185 190 

AGC CCC CAG GAG CGT CTG AGC CGG GAG ACG GCC CTC CAG CAG AAG CAG 624 
Ser Pro Gin Glu Arg Leu Ser Arg Glu Thr Ala Leu Gin Gin Lys Gin 
195 200 205 

GTG TCT CTG GAG GCC TGG TTG CAG CGT GAG GCA CAG ACA CTG CAG CAG 67 2 

Val Ser Leu Glu Ala Trp Leu Gin Arg Glu Ala Gin Thr Leu Gin Gin 
210 215 220 

TAC CGC GTG GAG CTG GCC GAG AAG CAC CAG AAG ACC CTG CAG CTG CTG 720 
Tyr Arg Val Glu Leu Ala Glu Lys His Gin Lys Thr Leu Gin Leu Leu 
225 230 235 240 

CGG AAG CAG CAG ACC ATC ATC CTG GAT GAC GAG CTG ATC CAG TGG AAG 768 
Arg Lys Gin Gin Thr He He Leu Asp Asp Glu Leu He Gin Trp Lys 
245 250 255 

CGG CGG CAG CAG CTG GCC GGG AAC GGC GGG CCC CCC GAG GGC AGC CTG 816 
Arg Arg Gin Gin Leu Ala Gly Asn Gly Gly Pro Pro Glu Gly Ser Leu 
260 265 270 

GAC GTG CTA CAG TCC TGG TGT GAG AAG TTG GCC GAG ATC ATC TGG CAG 864 
Asp Val Leu Gin Ser Trp Cys Glu Lys Leu Ala Glu He He Trp Gin 
275 280 285 

AAC CGG CAG CAG ATC CGC AGG GCT GAG CAC CTC TGC CAG CAG CTG CCC 912 
Asn Arg Gin Gin He Arg /Arg Ala Glu His Leu Cys Gin Gin Leu Pro 
290 295 300 

ATC CCC GGC CCA GTG GAG GAG ATG CTG GCC GAG GTC AAC GCC ACC ATC 960 
lie Pro Gly Pro Val Glu Glu Met Leu Ala Glu Val Asn Ala Thr He 
305 310 315 320 

ACG GAC ATT ATC TCA GCC CTG GTG ACC AGC ACA TTC ATC ATT GAG AAG 1008 
Thr Asp He He Ser Ala Leu Val Thr Ser Thr Phe He He Glu Lys 
325 330 335 

CAG CCT CCT CAG GTC CTG AAG ACC CAG ACC AAG TTT GCA GCC ACC GTA 1056 
Gin Pro Pro Gin Val Leu Lys Thr Gin Thr Lys Phe Ala Ala Thr Val 
340 345 350 

CGC CTG CTG GTG GGC GGG AAG CTG AAC GTG CAC ATG AAT CCC CCC CAG 1104 
Arg Leu Leu Val Gly Gly Lys Leu Asn Val His Met Asn Pro Pro Gin 
355 360 365 

GTG AAG GCC ACC ATC ATC AGT GAG CAG CAG GCC AAG TCT CTG CTT AAA 1152 
Val Lys Ala Thr He He Ser Glu Gin Gin Ala Lys Ser Leu Leu Lys 
370 375 380 

AAT GAG AAC ACC CGC AAC GAG TGC AGT GGT GAG ATC CTG AAC AAC TGC 1200 
Asn Glu Asn Thr Arg Asn Glu Cys Ser Gly Glu He Leu Asn Asn Cys 
385 390 395 400 

TGC GTG ATG GAG TAC CAC CAA GCC ACG GGC ACC CTC AGT GCC CAC TTC 1248 
Cys Val Met Glu Tyr His Gin Ala Thr Gly Thr Leu Ser Ala His Phe 
405 410 415 



AGG AAC ATG TCA CTG AAG AGG ATC AAG CGT GCT GAC CGG CGG GGT GCA 1296 

Arg Asn Met Ser Leu Lys Arg lie Lys Arg Ala Asp Arg Arg Gly Ala 
420 425 430 

GAG TCC GTG ACA GAG GAG AAG TTC ACA GTC CTG TTT GAG TCT CAG TTC 1344 

Glu Ser Val Thr Glu Glu Lys Phe Thr Val Leu Phe Glu Ser Gin Phe 
435 440 445 

AGT GTT GGC AGC AAT GAG CTT GTG TTC CAG GTG AAG ACT CTG TCC CTA 13 92 

Ser Val Gly Ser Asn Glu Leu Val Phe Gin Val Lys Thr Leu Ser Leu 
450 455 460 

CCT GTG GTT GTC ATC GTC CAC GGC AGC CAG GAC CAC AAT GCC ACG GCT 1440 

Pro Val Val Val He Val His Gly Ser Gin Asp His Asn Ala Thr Ala 
465 470 475 480 

ACT GTG CTG TGG GAC AAT GCC TTT GCT GAG CCG GGC AGG GTG CCA TTT 1488 

Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro Gly Arg Val Pro Phe 

485 490 495 

GCC GTG CCT GAC AAA GTG CTG TGG CCG CAG CTG TGT GAG GCG CTC AAC 1536 

Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu Cys Glu Ala Leu Asn 
500 505 510 

ATG AAA TTC AAG GCC GAA GTG CAG AGC AAC CGG GGC CTG ACC AAG GAG 1584 

Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg Gly Leu Thr Lys Glu 
515 520 525 

AAC CTC GTG TTC CTG GCG CAG AAA CTG TTC AAC AAC AGC AGC AGC CAC 1632 

Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn Asn Ser Ser Ser His 
530 535 540 

CTG GAG GAC TAC AGT GGC CTG TCC GTG TCC TGG TCC CAG TTC AAC AGG 1680 

Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp Ser Gin Phe Asn Arg 
545 550 555 560 

GAG AAC TTG CCG GGC TGG AAC TAC ACC TTC TGG CAG TGG TTT GAC GGG 1728 

Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp Gin Trp Phe Asp Gly 

565 570 575 

GTG ATG GAG GTG TTG AAG AAG CAC CAC AAG CCC CAC TGG AAT GAT GGG 177 6 

Val Met Glu Val Leu Lys Lys His His Lys Pro His Trp Asn Asp Gly 
580 585 590 

GCC ATC CTA GGT TTT GTG AAT AAG CAA CAG GCC CAC GAC CTG CTC ATC 1824 

Ala He Leu Gly Phe Val Asn Lys Gin Gin Ala His Asp Leu Leu He 
595 600 605 

AAC AAG CCC GAC GGG ACC TTC TTG TTG CGC TTT AGT GAC TCA GAA ATC 1872 

Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe Ser Asp Ser Glu He 
610 615 620 

GGG GGC ATC ACC ATC GCC TGG AAG TTT GAC TCC CCG GAA CGC AAC CTG 1920 

Gly Gly He Thr He Ala Trp Lys Phe Asp Ser Pro Glu Arg Asn Leu 

625 630 635 640 

TGG AAC CTG AAA CCA TTC ACC ACG CGG GAT TTC TCC ATC AGG TCC CTG 1968 



Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe Ser lie Arq Ser Leu 
645 650 655 

GCT GAC CGG CTG GGG GAC CTG AGC TAT CTC ATC TAT GTG TTT CCT GAC 2016 
Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu lie Tyr Val Phe Pro Asp 
660 665 670 

CGC CCC AAG GAT GAG GTC TTC TCC AAG TAC TAC ACT CCT GTG CTG GCT 2064 
Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr Thr Pro Val Leu Ala 
675 680 685 

AAA GCT GTT GAT GGA TAT GTG AAA CCA CAG ATC AAG CAA GTG GTC CCT 2112 
Lys Ala Val Asp Gly Tyr Val Lys Pro Gin lie Lys Gin Val Val Pro 
690 695 700 

GAG TTT GTG AAT GCA TCT GGA GAT GCT GGG GGC AGC AGC GCC ACG TAC 2160 
Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr Tyr 
705 710 715 720 

ATG GAC CAG GCC CCC TCC CCA GCT GTG TGC CCC CAG GCT CCC TAT AAC 22 08 
Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro Gin Ala Pro Tyr Asn 
725 730 735 

ATG TAC CCA CAG AAC CCT GAC CAT GTA CTC GAT CAG GAT GGA GAA TTC 2256 
Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp Gin Asp Gly Glu Phe 
740 745 750 

GAC CTG GAT GAG ACC ATG GAT GTG GCC AGG CAC GTG GAG GAA CTC TTA 23 04 
Asp Leu Asp Glu Thr Met Asp Val Ala Arg His Val Glu Glu Leu Leu 
755 760 765 

CGC CGA CCA ATG GAC AGT CTT GAC TCC CGC CTC TCG CCC CCT GCC GGT 23 52 
Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu Ser Pro Pro Ala Gly 

770 775 780 

CTT TTC ACC TCT GCC AGA GGC TCC CTC TCA TGG GTA CCG CGG GCC CGG 2400 
Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser Trp Val Pro Arg Ala Arg 
785 790 795 800 

GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC 2448 
Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
805 810 815 

GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC 2496 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
820 825 830 

AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG 2544 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
835 840 845 

CTG ACC CTG AAG TTC ATC TOZ ACC ACC GGC AAG CTG CCC GTG CCC TGG 2592 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
850 855 860 

CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC 2 640 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 
865 870 875 880 



TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC 2 6 88 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
885 890 895 

GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC 2736 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
900 905 910 

TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 27 84 
Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
915 920 925 

CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG 2 832 
Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
930 935 940 

GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 2880 
Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
945 950 955 960 

GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC 2928 
Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 
965 970 975 

AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC 2976 
Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
980 985 990 

ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG 3 024 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
995 1000 1005 

AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC 3072 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
1010 1015 1020 

ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG 3120 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met 
1025 1030 1035 1040 

GAC GAG CTG TAC AAG TAA 3138 
Asp Glu Leu Tyr Lys 
1045 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Met Ala Gly Trp 
1 

Gin Met Gin Val 
20 

Tyr Leu Ala Gin 
35 

Asp Asn Pro Gin 
50 

Val Gin Glu Leu 
65 

Phe Leu Leu Lys 

Thr Tyr Asp Arg 
100 

Leu Tyr Asn Glu 
115 

Pro Ala Gly lie 
130 

Asn Gin Thr Phe 
145 

Glu Leu Lys Lys 

Gin Glu Ser Leu 
180 

Ser Pro Gin Glu 
195 

Val Ser Leu Glu 
210 

Tyr Arg Val Glu 
225 

Arg Lys Gin Gin 

Arg Arg Gin Gin 
260 

Asp Val Leu Gin 
275 

Asn Arg Gin Gin 
290 

lie Pro Gly Pro 
305 

Thr Asp lie lie 

Gin Pro Pro Gin 
340 

Arg Leu Leu Val 
355 

Val Lys Ala Thr 
370 

Asn Glu Asn Thr 
385 

Cys Val Met Glu 

Arg Asn Met Ser 
420 

Glu Ser Val Thr 
435 

Ser Val Gly Ser 



lie Gin Ala Gin 
5 

Leu Tyr Gly Gin 

Trp He Glu Ser 
40 

Asp Arg Ala Gin 
55 

Gin Lys Lys Ala 
70 

He Lys Leu Gly 
85 

Cys Pro Leu Glu 

Gin Arg Leu Val 
120 

Leu Val Asp Ala 
135 

Glu Glu Leu Arg 
150 

Leu Gin Gin Thr 
165 

Arg He Gin Ala 

Arg Leu Ser Arg 
200 

Ala Trp Leu Gin 
215 

Leu Ala Glu Lys 
230 

Thr He He Leu 
245 

Leu Ala Gly Asn 

Ser Trp Cys Glu 
280 

He Arg Arg Ala 
295 

Val Glu Glu Met 
310 

Ser Ala Leu Val 
325 

Val Leu Lys Thr 

Gly Gly Lys Leu 
360 

He He Ser Glu 
375 

Arg Asn Glu Cys 
390 

Tyr His Gin Ala 
405 

Leu Lys Arg He 

Glu Glu Lys Phe 
440 

Asn Glu Leu Val 



Gin Leu Gin Gly 
10 

His Phe Pro He 
25 

Gin Pro Trp Asp 

Ala Thr Gin Leu 
60 

Glu His Gin Val 

75 

His Tyr Ala Thr 

90 

Leu Val Arg Cys 
105 

Arg Glu Ala Asn 

Met Ser Gin Lys 
140 

Leu Val Thr Gin 
155 

Gin Glu Tyr Phe 
170 

Gin Phe Ala Gin 
185 

Glu Thr Ala Leu 

Arg Glu Ala Gin 
220 

His Gin Lys Thr 
235 

Asp A.sp Glu Leu 
250 

Gly Gly Pro Pro 
265 

Lys Leu Ala Glu 

Glu His Leu Cys 
300 

Leu Ala Glu Val 
315 

Thr Ser Thr Phe 
330 

Gin Thr Lys Phe 
345 

Asn Val His Met 

Gin Gin Ala Lys 
380 

Ser Gly Glu He 
395 

Thr Gly Thr Leu 
410 

Lys Arg Ala Asp 
425 

Thr Val Leu Phe 
Phe Gin Val Lys 



Asp Ala Leu Arg 
15 

Glu Val Arg His 
30 

Ala He Asp Leu 
45 

Leu Glu Gly Leu 

Gly Glu Asp Gly 
80 

Gin Leu Gin Lys 
95 

He Arg His He 
110 

Asn Cys Ser Ser 
125 

His Leu Gin He 

Asp Thr Glu Asn 
160 

He He Gin Tyr 
175 

Leu Ala Gin Leu 
190 

Gin Gin Lys Gin 
205 

Thr Leu Gin Gin 

Leu Gin Leu Leu 
240 

lie Gin Trp Lys 
255 

Glu Gly Ser Leu 
270 

He He Trp Gin 
285 

Gin Gin Leu Pro 

Asn Ala Thr He 
320 

He He Glu Lys 
335 

Ala Ala Thr Val 
350 

Asn Pro Pro Gin 
365 

Ser Leu Leu Lys 

Leu Asn Asn Cys 
400 

Ser Ala His Phe 
415 

Arg Arg Gly Ala 
430 

Glu Ser Gin Phe 
445 

Thr Leu Ser Leu 



450 455 460 

Pro Val Val Val He Val His Gly Ser Gin Asp His Asn Ala Thr Ala 
465 470 475 480 

Thr Val Leu Trp Asp Asn Ala Phe Ala Glu Pro Gly Arg Val Pro Phe 

485 490 495 

Ala Val Pro Asp Lys Val Leu Trp Pro Gin Leu Cys Glu Ala Leu Asn 

500 505 510 

Met Lys Phe Lys Ala Glu Val Gin Ser Asn Arg Gly Leu Thr Lys Glu 

515 520 525 

Asn Leu Val Phe Leu Ala Gin Lys Leu Phe Asn Asn Ser Ser Ser His 

530 535 540 

Leu Glu Asp Tyr Ser Gly Leu Ser Val Ser Trp Ser Gin Phe Asn Arg 
545 550 555 560 

Glu Asn Leu Pro Gly Trp Asn Tyr Thr Phe Trp Gin Trp Phe Asp Gly 

565 570 575 

Val Met Glu Val Leu Lys Lys His His Lys Pro His Trp Asn Asp Gly 

580 585 590 

Ala He Leu Gly Phe Val Asn Lys Gin Gin Ala His Asp Leu Leu He 

595 600 605 

Asn Lys Pro Asp Gly Thr Phe Leu Leu Arg Phe Ser Asp Ser Glu He 

610 615 620 

Gly Gly lie Thr He Ala Trp Lys Phe Asp Ser Pro Glu Arg Asn Leu 
625 630 635 640 

Trp Asn Leu Lys Pro Phe Thr Thr Arg Asp Phe Ser lie Arg Ser Leu 

645 650 655 

Ala Asp Arg Leu Gly Asp Leu Ser Tyr Leu He Tyr Val Phe Pro Asp 

660 665 670 

Arg Pro Lys Asp Glu Val Phe Ser Lys Tyr Tyr Thr Pro Val Leu Ala 

675 680 685 

Lys Ala Val Asp Gly Tyr Val Lys Pro Gin He Lys Gin Val Val Pro 

690 695 700 

Glu Phe Val Asn Ala Ser Ala Asp Ala Gly Gly Ser Ser Ala Thr Tyr 
705 710 715 720 

Met Asp Gin Ala Pro Ser Pro Ala Val Cys Pro Gin Ala Pro Tyr Asn 

725 730 735 

Met Tyr Pro Gin Asn Pro Asp His Val Leu Asp Gin Asp Gly Glu Phe 

740 745 750 

Asp Leu Asp Glu Thr Met Asp Val Ala Arg His Val Glu Glu Leu Leu 

755 760 765 

Arg Arg Pro Met Asp Ser Leu Asp Ser Arg Leu Ser Pro Pro Ala Gly 

770 775 780 

Leu Phe Thr Ser Ala Arg Gly Ser Leu Ser Trp Val Pro Arg Ala Arg 
785 790 795 800 

Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 

805 810 815 

Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

820 825 830 

Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

835 840 845 

Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 

850 855 860 

Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 
865 870 875 880 

Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

885 890 895 

Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 

900 905 910 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 



915 920 925 

Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 

930 935 940 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 
945 950 955 960 

Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His 

965 970 975 

Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

980 985 990 

Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

995 1000 1005 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

1010 1015 1020 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met 
025 1030 1035 1040 

Asp Glu Leu Tyr Lys 
1045 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TGGGATCCTC AGGCCGTGCT GCTGGCCG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GTCTCGAGGG AGCATGGGCA CCTTGCG 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TGGGATCCGA GAAG TCTATA TCCCATC 

(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

TGGGATCCTT AGAAGTCTAT ATCCCATC 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GTCTCGAGCC ATGAACGCCC CCGAGCGG 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
GTGAATTCTC GTC TGATTTC TGGCAGGAGG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GTG AATTCTT TACGTCTGAT TTCTGGCAGG 

(2) INFORMATION FOR SEQ ID NO : 87 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTCTCGAGCC ATGGACGAAC TGTTCCCCCT CATC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
GTGGATCCAA GGAGCTGATC TGACTCAGCA G 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
GTGGATCCTT AGGAGCTGAT CTGACTCAGC AG 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 
CCTCCTAAGC TTATCATGGA CCATTATGAT TC 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
CCTCCTGGAT CCCTGCGCAG GATGATGGTC CAG 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
GGATGGAAGC TTCAATGGCT GCCATCCGGA AGAAACTGGT GATTG 
(2) INFORMATION FOR SEQ ID NO: 93: 

( i ) SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 93: 
GGATGGGGAT CCTCACAAGA CAAGGCAACC AGATTTTTTC TTCCC 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
GGGAAGCTTC CATGAGCGAG ACGGTCATC 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
CCCGGATCCT CAGGGAGAAC CCCGCTTC 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



( D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTGAATTCGA CCATGGAGCG GCCCCCGGGG 

(2) INFORMATION FOR SEQ ID NO : 97 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GTGGTACCCA TTCTGTTAAC CAACTCC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GTGGTACCTC ATTCTGTTAA CCAACTCC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GTCTCGAGAG ATGCTGTCCC GTGGGTGG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
GTGAATTCGC TTCCTCTTGA GGGAACC 
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(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
GTGAATTCAC TTCCTCTTGA GGGAACC 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 
GTCTCGAGCC ATGGAGAACT TCCAAAAGG 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
GTGGATCCCA GAGTCGAAGA TGGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
GTGGATCCTC AGAGTCGAAG ATGGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTGAATTCGG CGATGCCAGA CCCCGCGGCG 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GTGGATCCCA GGCACAGGCA GCCTCAGCCT TC 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GTGGATCCTC AGGCACAGGC AGCCTCAGCC TTC 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
fix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1. . .2613 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 



GTC GAG CTG GAC GGC GAC GTA AAC 
Val Glu Leu Asp Gly Asp Val Asn 
20 



GGC CAC AAG TTC AGO GTG TCC GGC 
Gly His Lys Phe Ser Val Ser Gly 
25 30 
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GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG GCG ATG CCA GAC CCC 768 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Met Pro Asp Pro 
245 250 255 

GCG GCG CAC CTG CCC TTC TTC TAC GGC AGC ATC TCG CGT GCC GAG GCC 816 
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Ala Ala His Leu Pro Phe Phe Tyr Gly Ser lie Ser Arg Ala Glu Ala 
260 265 270 

GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG CTC TTC CTG CTG 864 
Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly Leu Phe Leu Leu 
275 280 285 

CGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG CTG TCG CTC GTG CAC 912 
Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu Ser Leu Val His 
290 295 300 

GAT GTG CGC TTC CAC CAC TTT CCC ATC GAG CGC CAG CTC AAC GGC ACC 960 
Asp Val Arg Phe His His Phe Pro He Glu Arg Gin Leu Asn Gly Thr 
305 310 315 320 

TAC GCC ATT GCC GGC GGC AAA GCG CAC TGT GGA CCG GCA GAG CTC TGC 1008 
Tyr Ala He Ala Gly Gly Lys Ala His Cys Gly Pro Ala Glu Leu Cys 
325 330 335 

GAG TTC TAC TCG CGC GAC CCC GAC GGG CTG CCC TGC AAC CTG CGC AAG 1056 
Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys Asn Leu Arg Lys 
340 345 350 

CCG TGC AAC CGG CCG TCG GGC CTC GAG CCG CAG CCG GGG GTC TTC GAC 1104 
Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro Gly Val Phe Asp 
355 360 365 

TGC CTG CGA GAC GCC ATG GTG CGT GAC TAC GTG CGC CAG ACG TGG AAG 1152 
Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg Gin Thr Trp Lys 

370 375 380 

CTG GAG GGC GAG GCC CTG GAG CAG GCC ATC ATC AGC CAG GCC CCG CAG 1200 
Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser Gin Ala Pro Gin 
385 390 395 400 

GTG GAG AAG CTC ATT GCT ACG ACG GCC CAC GAG CGG ATG CCC TGG TAC 12 48 

Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg Met Pro Trp Tyr 
405 410 415 

CAC AGC AGC CTG ACG CGT GAG GAG GCC GAG CGC AAA CTT TAC TCT GGG 1296 
His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys Leu Tyr Ser Gly 
420 425 430 

GCG CAG ACC GAC GGC AAG TTC CTG CTG AGG CCG CGG AAG GAG CAG GGC 13 44 

Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg Lys Glu Gin Gly 
435 440 445 

AC A TAC GCC CTG TCC CTC ATC TAT GGG AAG ACG GTG TAC CAC TAC CTC 13 92 

Thr Tyr Ala Leu Ser Leu He Tyr Gly Lys Thr Val Tyr His Tyr Leu 
450 455 460 

ATC AGC CAA GAC AAG GCG GGC AAG TAC TGC ATT CCC GAG GGC ACC AAG 1440 
He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro Glu Gly Thr Lys 
465 470 475 480 

TTT GAC ACG CTC TGG CAG CTG GTG GAG TAT CTG AAG CTG AAG GCG GAC 14 88 

Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys Leu Lys Ala Asp 
485 490 495 
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G3G CTC ATC TAC TGC CTG AAG GAG GCC TGC CCC AAC AGC AGT GCC AGC 1536 
Gly Leu lie Tyr Cys Leu Lys Glu Ala Cys Pro Asn Ser Ser Ala Ser 
500 505 510 

AAC GCC TCA GGG GCT GCT GCT CCC ACA CTC CCA GCC CAC CCA TCC ACG 1584 
Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala His Pro Ser Thr 
515 520 525 

TTG ACT CAT CCT CAG AGA CGA ATC GAC ACC CTC AAC TCA GAT GGA TAC 1632 
Leu Thr His Pro Gin Arg Arg He Asp Thr Leu Asn Ser Asp Gly Tyr 
530 535 540 

ACC CCT GAG CCA GCA CGC ATA ACG TCC CCA GAC AAA CCG CGG CCG ATG 1680 
Thr Pro Glu Pro Ala Arg He Thr Ser Pro Asp Lys Pro Arg Pro Met 
545 550 555 560 

CCC ATG GAC ACG AGC GTG TAT GAG AGC CCC TAC AGC GAC CCA GAG GAG 17 28 
Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser Asp Pro Glu Glu 
565 570 575 

CTC AAG GAC AAG AAG CTC TTC CTG AAG CGC GAT AAC CTC CTC ATA GCT 1776 
Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu He Ala 
580 585 590 

GAC ATT GAA CTT GGC TGC GGC AAC TTT GGC TCA GTG CGC CAG GGC GTG 1824 
Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 
595 600 605 

TAC CGC ATG CGC AAG AAG CAG ATC GAC GTG GCC ATC AAG GTG CTG AAG 1872 
Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He Lys Val Leu Lys 
610 615 620 

CAG GGC ACG GAG AAG GCA GAC ACG GAA GAG ATG ATG CGC GAG GCG CAG 1920 
Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 
625 630 635 640 

ATC ATG CAC CAG CTG GAC AAC CCC TAC ATC GTG CGG CTC ATT GGC GTC 1968 
lie Met His Gin Leu Asp Asn Pro Tyr He Val Arg Leu He Gly Val 
645 650 655 

TGC CAG GCC GAG GCC CTC ATG CTG GTC ATG GAG ATG GCT GGG GGC GGG 2016 
Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met Ala Gly Gly Gly 
660 665 670 

CCG CTG CAC AAG TTC CTG GTC GGC AAG AGG GAG GAG ATC CCT GTG AGC 2064 
Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu He Pro Val Ser 
675 680 685 

AAT GTG GCC GAG CTG CTG CAC CAG GTG TCC ATG GGG ATG AAG TAC CTG 2112 
Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 
690 695 700 

GAG GAG AAG AAC TTT GTG CAC CGT GAC CTG GCG GCC CGC AAC GTC CTG 2160 
Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 
705 710 715 720 

CTG GTT AAC CGG CAC TAC GCC AAG ATC AGC GAC TTT GGC CTC TCC AAA 2208 
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Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe Gly Leu Ser Lys 
725 730 735 

GCA CTG GGT GCC GAC GAC AGC TAC TAC ACT GCC CGC TCA GCA GGG AAG 2256 
Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg Ser Ala Gly Lys 
740 745 750 

TGG CCG CTC AAG TGG TAC GCA CCC GAA TGC ATC AAC TTC CGC AAG TTC 2304 
Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn Phe Arg Lys Phe 
755 760 765 

TCC AGC CGC AGC GAT GTC TGG AGC TAT GGG GTC ACC ATG TGG GAG GCC 2352 
Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr Met Trp Glu Ala 
770 775 780 

TTG TCC TAC GGC CAG AAG CCC TAC AAG AAG ATG AAA GGG CCG GAG GTC 2400 
Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys Gly Pro Glu Val 
785 790 795 800 

ATG GCC TTC ATC GAG CAG GGC AAG CGG ATG GAG TGC CCA CCA GAG TGT 2448 
Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys Pro Pro Glu Cys 
805 810 815 

CCA CCC GAA CTG TAC GCA CTC ATG AGT GAC TGC TGG ATC TAC AAG TGG 2496 
Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp He Tyr Lys Trp 
820 825 830 

GAG GAT CGC CCC GAC TTC CTG ACC GTG GAG CAG CGC ATG CGA GCC TGT 2 544 
Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg Met Arg Ala Cys 
835 840 845 

TAC TAC AGC CTG GCC AGC AAG GTG GAA GGG CCC CCA GGC AGC AC A CAG 2 592 
Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro Gly Ser Thr Gin 
850 855 860 

AAG GCT GAG G2T GCC TGT GCC TGA 2 616 

Lys Ala Glu Ala Ala Cys Ala 
865 870 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Fhe Ser Val Ser Gly 

2C 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
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35 

Cys Thr Thr Gly 
50 

Leu Thr Tyr Gly 
65 

Gin His Asp Phe 

Arg Thr lie Phe 
100 

Val Lys Phe Glu 
115 

lie Asp Phe Lys 
130 

Asn Tyr Asn Ser 
145 

Gly lie Lys Veil 

Val Gin Leu Ala 
180 

Pro Val Leu Leu 
195 

Ser Lys Asp Pro 
210 

Val Thr Ala Ala 
225 

Gly Leu Arg Ser 

Ala Ala His Leu 
260 

Glu Glu His Leu 
275 

Arg Gin Cys Leu 
290 

Asp Val Arg Phe 
305 

Tyr Ala He Ala 

Glu Phe Tyr Ser 
340 

Pro Cys Asn Arg 
355 

Cys Leu Arg Asp 
370 

Leu Glu Gly Glu 
385 

Val Glu Lys Leu 

His Ser Ser Leu 
420 

Ala Gin Thr Asp 
435 

Thr Tyr Ala Leu 
450 

He Ser Gin Asp 
465 

Phe Asp Thr Leu 
Gly Leu He Tyr 



40 

Lys Leu Pro Val 
55 

Val Gin Cys Phe 

70 

Phe Lys Ser Ala 
85 

Phe Lys Asp Asp 

Gly Asp Thr Leu 
120 

Glu Asp Gly Asn 
135 

His Asn Val Tyr 
150 

Asn Phe Lys He 

165 

Asp His Tyr Gin 

Pro Asp Asn His 
200 

Asn Glu Lys Arg 
215 

Gly lie Thr Leu 
230 

Arg Ala Gin Ala 
245 

Pro Phe Phe Tyr 

Lys Leu Ala Gly 
280 

Arg Ser Leu Gly 
295 

His His Phe Pro 
310 

Gly Gly Lys Ala 
325 

Arg Asp Pro Asp 

Pro Ser Gly Leu 
360 

Ala Met Val Arg 
375 

Ala Leu Glu Gin 
390 

He Ala Thr Thr 

405 

Thr Arg Glu Glu 

Gly Lys Phe Leu 
440 

Ser Leu He Tyr 
455 

Lys Ala Gly Lys 
470 

Ti-p Gin Leu Val 
4&S 

Cys Leu Lys Glu 



Pro Trp Pro Thr 
60 

Ser Arg Tyr Pro 
75 

Met Pro Glu Gly 

90 

Gly Asn Tyr Lys 
105 

Val Asn Arg He 

He Leu Gly His 
140 

He Met Ala Asp 
155 

Arg His Asn He 
170 

Gin Asn Thr Pro 
185 

Tyr Leu Ser Thr 

Asp His Met Val 

220 

Gly Met Asp Glu 
235 

Ser Asn Ser Ala 
250 

Gly Ser He Ser 
265 

Met Ala Asp Gly 

Gly Tyr Val Leu 
300 

He Glu Arg Gin 
315 

His Cys Gly Pro 
330 

Gly Leu Pro Cys 
345 

Glu Pro Gin Pro 

Asp Tyr Val Arg 

380 

Ala He He Ser 
395 

Ala His Glu Arg 
410 

Ala Glu Arg Lys 
425 

Leu Arg Pro Arg 

Gly Lys Thr Val 
460 

Tyr Cys He Pro 
475 

Glu Tyr Leu Lys 
490 

Ala Cys Pro Asn 



45 

Leu Val Thr Thr 

Asp His Met Lys 
80 

Tyr Val Gin Glu 
95 

Thr Arg Ala Glu 
110 

Glu Leu Lys Gly 
125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 
160 

Glu Asp Gly Ser 
175 

He Gly Asp Gly 
190 

Gin Ser Ala Leu 

205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 
240 

Met Pro Asp Pro 
255 

Arg Ala Glu Ala 
270 

Leu Phe Leu Leu 
285 

Ser Leu Val His 

Leu Asn Gly Thr 
320 

Ala Glu Leu Cys 
335 

Asn Leu Arg Lys 
350 

Gly Val Phe Asp 
365 

Gin Thr Trp Lys 

Gin Ala Pro Gin 
400 

Met Pro Trp Tyr 
415 

Leu Tyr Ser Gly 
430 

Lys Glu Gin Gly 
445 

Tyr His Tyr Leu 

Glu Gly Thr Lys 
480 

Leu Lys Ala Asp 
495 

Ser Ser Ala Ser 
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500 505 510 

Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala His Pro Ser Thr 

515 520 525 

Leu Thr His Pro Gin Arg Arg lie Asp Thr Leu Asn Ser Asp Gly Tyr 

530 535 540 

Thr Pro Glu Pro Ala Arg lie Thr Ser Pro Asp Lys Pro Arg Pro Met 
545 550 555 560 

Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser Asp Pro Glu Glu 

565 570 575 

Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn Leu Leu lie Ala 

580 585 590 

Asp lie Glu Leu Gly Cys Gly Asn Phe Gly Ser Val Arg Gin Gly Val 

595 600 605 

Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He Lys Val Leu Lys 

610 615 620 

Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met Arg Glu Ala Gin 
625 630 635 640 

He Met His Gin Leu Asp Asn Pro Tyr He Val Arg Leu He Gly Val 

645 650 655 

Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met Ala Gly Gly Gly 

660 665 670 

Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu He Pro Val Ser 

675 680 685 

Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly Met Lys Tyr Leu 

690 695 700 

Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 
705 710 715 720 

Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe Gly Leu Ser Lys 

725 730 735 

Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg Ser Ala Gly Lys 

740 745 750 

Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn Phe Arg Lys Phe 

755 760 765 

Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr Met Trp Glu Ala 

770 775 780 

Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys Gly Pro Glu Val 
785 790 795 800 

Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys Pro Pro Glu Cys 

805 810 815 

Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp He Tyr Lys Trp 

820 825 830 

Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg Met Arg Ala Cys 

835 840 845 

Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro Gly Ser Thr Gin 

850 855 860 

Lys Ala Glu Ala Ala Cys Ala 
865 870 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 2598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



( i i ) MOLECULE TYPE : cDNA 
(ix) FEATURE: 



/3S 



(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1. . .2595 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 

ATG CCA GAC CCC GCG GCG CAC CTG CCC TTC TTC TAC GGC AGC ATC TCG 4 8 

Met Pro Asp Pro Ala Ala His Leu Pro Phe Phe Tyr Gly Ser He Ser 
15 10 15 

CGT GCC GAG GCC GAG GAG CAC CTG AAG CTG GCG GGC ATG GCG GAC GGG 96 
Arg Ala Glu Ala Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly 
20 25 30 

CTC TTC CTG CTG CGC CAG TGC CTG CGC TCG CTG GGC GGC TAT GTG CTG 144 
Leu Phe Leu Leu Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu 
35 40 45 

TCG CTC GTG CAC GAT GTG CGC TTC CAC CAC TTT CCC ATC GAG CGC CAG 192 
Ser Leu Val His Asp Val Arg Phe His His Phe Pro He Glu Arg Gin 
50 55 60 

CTC AAC GGC ACC TAC GCC ATT GCC GGC GGC AAA GCG CAC TGT GGA CCG 240 
Leu Asn Gly Thr Tyr Ala He Ala Gly Gly Lys Ala His Cys Gly Pro 
65 70 75 80 

GCA GAG CTC TGC GAG TTC TAC TCG CGC GAC CCC GAC GGG CTG CCC TGC 288 
Ala Glu Leu Cys Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys 
85 90 95 

AAC CTG CGC AAG CCG TGC AAC CGG CCG TCG GGC CTC GAG CCG CAG CCG 33 6 

Asn Leu Arg Lys Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro 
100 105 110 

GGG GTC TTC GAC TGC CTG CGA GAC GCC ATG GTG CGT GAC TAC GTG CGC 3 84 

Gly Val Phe Asp Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg 
115 120 125 

CAG ACG TGG AAG CTG GAG GGC GAG GCC CTG GAG CAG GCC ATC ATC AGC 432 
Gin Thr Trp Lys Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser 
130 135 140 

CAG GCC CCG CAG GTG GAG AAG CTC ATT GCT ACG ACG GCC CAC GAG CGG 480 
Gin Ala Pro Gin Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg 
145 150 155 160 

ATG CCC TGG TAC CAC AGC AGC CTG ACG CGT GAG GAG GCC GAG CGC AAA 528 
Met Pro Trp Tyr His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys 
165 170 175 

CTT TAC TCT GGG GCG CAG ACC GAC GGC AAG TTC CTG CTG AGG CCG CGG 576 
Leu Tyr Ser Gly Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg 
180 185 190 

AAG GAG CAG GGC ACA TAC GCC CTG TCC CTC ATC TAT GGG AAG ACG GTG 624 
Lys Glu Gin Gly Thr Tyr Ala Leu Ser Leu He Tyr Gly Lys Thr Val 
195 200 205 



/ 3 ? 



TAC CAC TAC CTC ATC AGC CAA GAC AAG GCG GGC AAG TAC TGC ATT CCC 67 2 

Tyr His Tyr Leu He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro 
210 215 220 

GAG GGC ACC AAG TTT GAC ACG CTC TGG CAG CTG GTG GAG TAT CTG AAG 720 
Glu Gly Thr Lys Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys 
225 230 235 240 

CTG AAG GCG GAC GGG CTC ATC TAC TGC CTG AAG GAG GCC TGC CCC AAC 768 
Leu Lys Ala Asp Gly Leu He Tyr Cys Leu Lys Glu Ala Cys Pro Asn 
245 250 255 

AGC AGT GCC AGC AAC GCC TCA GGG GCT GCT GCT CCC ACA CTC CCA GCC 816 
Ser Ser Ala Ser Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala 
260 265 270 

CAC CCA TCC ACG TTG ACT CAT CCT CAG AGA CGA ATC GAC ACC CTC AAC 864 
His Pro Ser Thr Leu Thr His Pro Gin Arg Arg He Asp Thr Leu Asn 
275 280 285 

TCA GAT GGA TAC ACC CCT GAG CCA GCA CGC ATA ACG TCC CCA GAC AAA 912 
Ser Asp Gly Tyr Thr Pro Glu Pro Ala Arg He Thr Ser Pro Asp Lys 
290 295 300 

CCG CGG CCG ATG CCC ATG GAC ACG AGC GTG TAT GAG AGC CCC TAC AGC 960 
Pro Arg Pro Met Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser 
305 310 315 320 

GAC CCA GAG GAG CTC AAG GAC AAG AAG CTC TTC CTG AAG CGC GAT AAC 10 OS 

Asp Pro Glu Glu Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn 
325 330 335 

CTC CTC ATA GCT GAC ATT GAA CTT GGC TGC GGC AAC TTT GGC TCA GTG 1056 
Leu Leu He Ala Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val 
340 345 350 

CGC CAG GGC GTG TAC CGC ATG CGC AAG AAG CAG ATC GAC GTG GCC ATC 1104 
Arg Gin Gly Val Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He 
355 360 365 

AAG GTG CTG AAG CAG GGC ACG GAG AAG GCA GAC ACG GAA GAG ATG ATG 1152 
Lys Val Leu Lys Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met 
370 375 380 

CGC GAG GCG CAG ATC ATG CAC CAG CTG GAC AAC CCC TAC ATC GTG CGG 12 00 

Arg Glu Ala Gin He Met His Gin Leu Asp Asn Pro Tyr He Val Arg 
385 390 395 400 

CTC ATT GGC GTC TGC CAG GCC GAG GCC CTC ATG CT3 GTC ATG GAG ATG 12 4 8 

Leu He Gly Val Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met 
405 410 415 

GCT GGG GGC GGG CCG CTC CAC AAG TTC CTG GTC GG2 AAG AGG GAG GAG 12 96 
Ala Gly Gly Gly Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu 
420 425 430 

ATC CCT GTG AGC AAT GTG GCC GAG CTG CTG CAC CAG GTG TCC ATG GGG 13 44 



lie Pro Val Ser Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly 
435 440 445 

ATG AAG TAC CTG GAG GAG AAG AAC TTT GTG CAC CGT GAC CTG GCG GCC 13 92 
Met Lys Tyr Leu Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala 
450 455 460 

CGC AAC GTC CTG CTG GTT AAC CGG CAC TAC GCC AAG ATC AGC GAC TTT 1440 
Arg Asn Val Leu Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe 
465 470 475 480 

GGC CTC TCC AAA GCA CTG GGT GCC GAC GAC AGC TAC TAC ACT GCC CGC 1488 
Gly Leu Ser Lys Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg 
485 490 495 

TCA GCA GGG AAG TGG CCG CTC AAG TGG TAC GCA CCC GAA TGC ATC AAC 1536 
Ser Ala Gly Lys Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn 
500 505 510 

TTC CGC AAG TTC TCC AGC CGC AGC GAT GTC TGG AGC TAT GGG GTC ACC 1584 
Phe Arg Lys Phe Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr 
515 520 525 

ATG TGG GAG GCC TTG TCC TAC GGC CAG AAG CCC TAC AAG AAG ATG AAA 1632 
Met Trp Glu Ala Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys 
530 535 540 

GGG CCG GAG GTC ATG GCC TTC ATC GAG CAG GGC AAG CGG ATG GAG TGC 1680 
Gly Pro Glu Val Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys 
545 550 555 560 

CCA CCA GAG TGT CCA CCC GAA CTG TAC GCA CTC ATG AGT GAC TGC TGG 172 8 
Pro Pro Glu Cys Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp 
565 570 575 

ATC TAC AAG TGG GAG GAT CGC CCC GAC TTC CTG ACC GTG GAG CAG CGC 1776 
He Tyr Lys Trp Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg 
580 585 590 

ATG CGA GCC TGT TAC TAC AGC CTG GCC AGC AAG GTG GAA GGG CCC CCA 1824 
Met Arg Ala Cys Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro 
595 600 605 

GGC AGC ACA CAG AAG GCT GAG GCT GCC TGT GCC TGG GAT CCA CCG GTC 1872 
Gly Ser Thr Gin Lys Ala Glu Ala Ala Cys Ala Trp Asp Pro Pro Val 
610 615 620 

GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC 1920 
Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
625 630 635 640 

ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG 1968 
He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 
645 650 655 

TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG 2016 
Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
660 665 670 



TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG 2 064 
Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 
675 680 685 

ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC 2112 
Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 
690 695 700 

ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC 2160 
Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
705 710 715 720 

CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC 2208 
Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
725 730 735 

GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG 22 56 
Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 
740 745 750 

AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG 2 3 04 
Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 
755 760 765 

GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG 2 3 52 
Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin 
770 775 780 

AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC 24 00 
Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp 
785 790 795 800 

GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC 24 4 8 
Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 
805 810 815 

GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC 2496 
Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 
820 825 830 

GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG 2 544 
Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 
835 840 845 

GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC 2 5 92 
Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 
850 855 860 

AAG TAA 2 59£ 

Lys 

865 



(2) INFORMATION FOR SEQ ID NO : 1 1 1 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 865 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Met Pro Asp Pro Ala Ala His Leu Pro Phe Phe Tyr Gly Ser lie Ser 

15 10 15 

Arg Ala Glu Ala Glu Glu His Leu Lys Leu Ala Gly Met Ala Asp Gly 

20 25 30 

Leu Phe Leu Leu Arg Gin Cys Leu Arg Ser Leu Gly Gly Tyr Val Leu 

35 40 45 

Ser Leu Val His Asp Val Arg Phe His His Phe Pro lie Glu Arg Gin 

50 55 60 

Leu Asn Gly Thr Tyr Ala He Ala Gly Gly Lys Ala His Cys Gly Pro 
65 70 75 80 

Ala Glu Leu Cys Glu Phe Tyr Ser Arg Asp Pro Asp Gly Leu Pro Cys 

85 90 95 

Asn Leu Arg Lys Pro Cys Asn Arg Pro Ser Gly Leu Glu Pro Gin Pro 

100 105 110 

Gly Val Phe Asp Cys Leu Arg Asp Ala Met Val Arg Asp Tyr Val Arg 

115 120 125 

Gin Thr Trp Lys Leu Glu Gly Glu Ala Leu Glu Gin Ala He He Ser 

130 135 140 

Gin Ala Pro Gin Val Glu Lys Leu He Ala Thr Thr Ala His Glu Arg 
145 150 155 160 

Met Pro Trp Tyr His Ser Ser Leu Thr Arg Glu Glu Ala Glu Arg Lys 

165 170 175 

Leu Tyr Ser Gly Ala Gin Thr Asp Gly Lys Phe Leu Leu Arg Pro Arg 

180 185 190 

Lys Glu Gin Gly Thr Tyr Ala Leu Ser Leu He Tyr Gly Lys Thr Val 

195 200 205 

Tyr His Tyr Leu He Ser Gin Asp Lys Ala Gly Lys Tyr Cys He Pro 

210 215 220 

Glu Gly Thr Lys Phe Asp Thr Leu Trp Gin Leu Val Glu Tyr Leu Lys 
225 230 235 240 

Leu Lys Ala Asp Gly Leu He Tyr Cys Leu Lys Glu Ala Cys Pro Asn 

245 250 255 

Ser Ser Ala Ser Asn Ala Ser Gly Ala Ala Ala Pro Thr Leu Pro Ala 

260 265 270 

His Pro Ser Thr Leu Thr His Pro Gin Arg Arg He Asp Thr Leu Asn 

275 280 285 

Ser Asp Gly Tyr Thr Pro Glu Pro Ala Arg He Thr Ser Pro Asp Lys 

290 295 300 

Pro Arg Pro Met Pro Met Asp Thr Ser Val Tyr Glu Ser Pro Tyr Ser 
305 310 315 320 

Asp Pro Glu Glu Leu Lys Asp Lys Lys Leu Phe Leu Lys Arg Asp Asn 

325 330 335 

Leu Leu lie Ala Asp He Glu Leu Gly Cys Gly Asn Phe Gly Ser Val 

340 345 350 

Arg Gin Gly Val Tyr Arg Met Arg Lys Lys Gin He Asp Val Ala He 

355 360 365 

Lys Val Leu Lys Gin Gly Thr Glu Lys Ala Asp Thr Glu Glu Met Met 

370 375 380 

Arg Glu Ala Gin lie Met His Gin Leu Asp Asn Pro Tyr He Val Arg 
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385 390 395 400 

Leu lie Gly Val Cys Gin Ala Glu Ala Leu Met Leu Val Met Glu Met 

405 410 415 

Ala Gly Gly Gly Pro Leu His Lys Phe Leu Val Gly Lys Arg Glu Glu 

420 425 430 

He Pro Val Ser Asn Val Ala Glu Leu Leu His Gin Val Ser Met Gly 

435 440 445 

Met Lys Tyr Leu Glu Glu Lys Asn Phe Val His Arg Asp Leu Ala Ala 

450 455 460 

Arg Asn Val Leu Leu Val Asn Arg His Tyr Ala Lys He Ser Asp Phe 
465 470 475 480 

Gly Leu Ser Lys Ala Leu Gly Ala Asp Asp Ser Tyr Tyr Thr Ala Arg 

485 490 495 

Ser Ala Gly Lys Trp Pro Leu Lys Trp Tyr Ala Pro Glu Cys He Asn 

500 505 510 

Phe Arg Lys Phe Ser Ser Arg Ser Asp Val Trp Ser Tyr Gly Val Thr 

515 520 525 

Met Trp Glu Ala Leu Ser Tyr Gly Gin Lys Pro Tyr Lys Lys Met Lys 

530 535 540 

Gly Pro Glu Val Met Ala Phe He Glu Gin Gly Lys Arg Met Glu Cys 
545 550 555 560 

Pro Pro Glu Cys Pro Pro Glu Leu Tyr Ala Leu Met Ser Asp Cys Trp 

565 570 575 

He Tyr Lys Trp Glu Asp Arg Pro Asp Phe Leu Thr Val Glu Gin Arg 

580 585 590 

Met Arg Ala Cys Tyr Tyr Ser Leu Ala Ser Lys Val Glu Gly Pro Pro 

595 600 605 

Gly Ser Thr Gin Lys Ala Glu Ala Ala Cys Ala Trp Asp Pro Pro Val 

610 615 620 

Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
625 630 635 640 

He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 

645 650 655 

Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 

660 665 670 

Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 

675 680 685 

Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His 

690 695 700 

Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
705 710 715 720 

Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 

725 730 735 

Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu 

740 745 750 

Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 

755 760 765 

Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin 

770 775 780 

Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp 
785 790 795 800 

Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly 

805 810 815 

Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser 

820 825 830 

Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 

835 840 845 

Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr 
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850 855 860 

Lys 
865 

(2) INFORMATION FOR SEQ ID NO: 112: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1632 
(D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: 

ATC GAG AAC TTC CAA AAG GTG GAA AAG ATC GGA GAG GGC ACG TAC GGA 48 
Met Glu Asn Phe Gin Lys Val Glu Lys lie Gly Glu Gly Thr Tyr Gly 
15 10 15 

GTT GTG TAC AAA GCC AGA AAC AAG TTG ACG GGA GAG GTG GTG GCG CTT 96 
Val Val Tyr Lys Ala Arg Asn Lys Leu Thr Gly Glu Val Val Ala Leu 
20 25 30 

AAG AAA ATC CGC CTG GAC ACT GAG ACT GAG GGT GTG CCC AGT ACT GCC 144 
Lys Lys lie Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 
35 40 45 

ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT AAC CAT CCT AAT ATT GTC 192 
lie Arg Glu lie Ser Leu Leu Lys Glu Leu Asn His Pro Asn lie Val 
50 55 60 

AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT AAA CTC TAC CTG GTT TTT 240 
Lys Leu Leu Asp Val lie His Thr Glu Asn Lys Leu Tyr Leu Val Phe 
65 70 75 80 

GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC ATG GAT GCC TCT GCT CTC 288 
Glu Phe Leu His Gin Asp Leu Lys Lys Phe Met Asp Ala Ser Ala Leu 
85 90 95 

ACT GGC ATT CCT CTT CCC CTC ATC AAG AGC TAT CTG TTC CAG CTG CTC 336 
Thr Gly lie Pro Leu Pro Leu lie Lys Ser Tyr Leu Phe Gin Leu Leu 
100 105 110 

CAG GGC CTA GCT TTC TGC CAT TCT CAT CGG GTC CTC CAC CGA GAC CTT 3 84 

Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 
115 120 125 

AAA CCT CAG AAT CTG CTT ATT AAC ACA GAG GGG GCC ATC AAG CTA GCA 432 
Lys Pro Gin Asn Leu Leu lie Asn Thr Glu Gly Ala lie Lys Leu Ala 
130 135 140 

GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC CCT GTT CGT ACT TAC ACC 4 80 
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Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg Thr Tyr Thr 
145 150 155 160 

CAT GAG GTG GTG ACC CTG TGG TAC CGA GCT CCT GAA ATC CTC CTG GGC 528 
His Glu Val Val Thr Leu Trp Tyr Arg Ala Pro Glu lie Leu Leu Gly 
165 170 175 

TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC TGG AGC CTG GGC TGC ATC 576 
Ser Lys Tyr Tyr Ser Thr Ala Val Asp He Trp Ser Leu Gly Cys He 
180 185 190 

TTT GCT GAG ATG GTG ACT CGC CGG GCC CTG TTC CCT GGA GAT TCT GAG 624 
Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 
195 200 205 

ATT GAC CAG CTC TTC CGG ATC TTT CGG ACT CTG GGG ACC CCA GAT GAG 672 
He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 
210 215 220 

GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT GAT TAC AAG CCA AGT TTC 720 
Val Val Trp Pro Gly Val Thr Ser Met Pro Asp Tyr Lys Pro Ser Phe 
225 230 235 240 

CCC AAG TGG GCC CGG CAA GAT TTT AGT AAA GTT GTA CCT CCC CTG GAT 7 68 

Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu Asp 
245 250 255 

GAA GAT GGA CGG AGC TTG TTA TCG CAA ATG CTG CAC TAC GAC CCT AAC 816 
Glu Asp Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr Asp Pro Asn 
260 265 270 

AAG CGG ATT TCG GCC AAG GCA GCC CTG GCT CAC CCT TTC TTC CAG GAT 864 
Lys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Phe Gin Asp 
275 280 285 

GTG ACC AAG CCA GTA CCC CAT CTT CGA CTC TGG GAT CCA CCG GTC GCC 912 
Val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 
290 295 300 

ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC 960 
Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
305 310 315 320 

CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC 1008 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
325 330 335 

GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC 1056 
Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
340 345 350 

ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC 1104 
He Cys Thr Thr Gly Lys Leu Fro Val Pro Trp Pro Thr Leu Val Thr 
355 360 365 

ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG 1152 
Tnr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 
370 375 380 



AAG CAG CAC GAC TTG TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG 12 00 
Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
385 390 395 400 

GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC 12 4 8 
Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
405 410 415 

GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG 12 96 
Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 
420 425 430 

GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG 1344 
Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu 
435 440 445 

TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG 13 92 
Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys 
450 455 460 

AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC 1440 
Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly 
465 470 475 480 

AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC 1488 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp 
485 490 495 

GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC 153 6 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 
500 505 510 

CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG 1584 
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
515 520 525 

TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG T 1633 
Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
530 535 540 

AA 1635 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



Met Glu Asn Phe Gin Lys Val Glu Lys He Gly Glu Gly Thr Tyr Gly 
15 10 15 



Val Val Tyr Lys Ala Arg Asn Lys Leu Thr Gly Glu Val Val Ala Leu 

20 25 30 

Lys Lys He Arg Leu Asp Thr Glu Thr Glu Gly Val Pro Ser Thr Ala 

35 40 45 

He Arg Glu He Ser Leu Leu Lys Glu Leu Asn His Pro Asn He Val 

50 55 60 

Lys Leu Leu Asp Val He His Thr Glu Asn Lys Leu Tyr Leu Val Phe 
65 70 75 80 

Glu Phe Leu His Gin Asp Leu Lys Lys Phe Met Asp Ala Ser Ala Leu 

85 90 95 

Thr Gly He Pro Leu Pro Leu He Lys Ser Tyr Leu Phe Gin Leu Leu 

100 105 HO 

Gin Gly Leu Ala Phe Cys His Ser His Arg Val Leu His Arg Asp Leu 

115 120 125 

Lys Pro Gin Asn Leu Leu He Asn Thr Glu Gly Ala He Lys Leu Ala 

130 135 140 

Asp Phe Gly Leu Ala Arg Ala Phe Gly Val Pro Val Arg Thr Tyr Thr 
145 150 155 160 

His Glu Val Val Thr Leu Trp Tyr Arg Ala Pro Glu He Leu Leu Gly 

165 170 175 

Ser Lys Tyr Tyr Ser Thr Ala Val Asp He Trp Ser Leu Gly Cys lie 

180 185 190 

Phe Ala Glu Met Val Thr Arg Arg Ala Leu Phe Pro Gly Asp Ser Glu 

195 200 205 

He Asp Gin Leu Phe Arg He Phe Arg Thr Leu Gly Thr Pro Asp Glu 

210 215 220 

Val Val Trp Pro Gly Val Thr Ser Met Pro Asp Tyr Lys Pro Ser Phe 
225 230 235 240 

Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys Val Val Pro Pro Leu Asp 

245 250 255 

Glu Asp Gly Arg Ser Leu Leu Ser Gin Met Leu His Tyr Asp Pro Ajsn 

260 265 270 

Lys Arg He Ser Ala Lys Ala Ala Leu Ala His Pro Phe Phe Gin Asp 

275 280 285 

Val Thr Lys Pro Val Pro His Leu Arg Leu Trp Asp Pro Pro Val Ala 

290 295 300 

Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
305 310 315 320 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 

325 330 335 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 

340 345 350 

He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

355 360 365 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met 

370 375 380 

Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
385 390 395 400 

Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 

405 410 415 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys 

420 425 430 

Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 

435 440 445 

Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 

450 455 460 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 
465 470 475 480 



Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp 

485 490 495 

Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 

500 505 510 

Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 

515 520 525 

Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1635 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1632 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 



/ 9 ? 



ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 4 32 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCC ATG GAG AAC TTC CAA AAG GTG GAA AAG ATC 768 
Gly Leu Arg Ser Arg Ala Met Glu Asn Phe Gin Lys Val Glu Lys He 
245 250 255 

GGA GAG GGC ACG TAC GGA GTT GTG TAC AAA GCC AGA AAC AAG TTG ACG 816 
Gly Glu Gly Thr Tyr Gly Val Val Tyr Lys Ala Arg Asn Lys Leu Thr 
260 265 270 

GGA GAG GTG GTG GCG CTT AAG AAA ATC CGC CTG GAC ACT GAG ACT GAG 864 
Gly Glu Val Val Ala Leu Lys Lys He Arg Leu Asp Thr Glu Thr Glu 
275 280 285 

GGT GTG CCC AGT ACT GCC ATC CGA GAG ATC TCT CTG CTT AAG GAG CTT 912 
Gly Val Pro Ser Thr Ala lie Arg Glu He Ser Leu Leu Lys Glu Leu 
290 295 300 

AAC CAT CCT AAT ATT GTC AAG CTG CTG GAT GTC ATT CAC ACA GAA AAT 9 60 

Asn His Pro Asn He Val Lys Leu Leu Asp Val lie His Thr Glu Asn 
305 310 315 320 

AAA CTC TAC CTG GTT TTT GAA TTT CTG CAC CAA GAT CTC AAG AAA TTC 1008 
Lys Leu Tyr Leu Val Phe Glu Phe Leu His Gin Asp Leu Lys Lys Phe 
325 330 335 

ATG GAT GCC TCT GCT CTC ACT G3C ATT CCT CTT CCC CTC ATC AAG AGC 1056 
Met Asp Ala Ser Ala Leu Thr Gly He Pro Leu Pro Leu He Lys Ser 
340 345 350 

TAT CTG TTC CAG CTG CTC CAG GSC CTA GCT TTC TGC CAT TCT CAT CGG 1104 
Tvr Leu Phe Gin Leu Leu Gin Gly Leu Ala Phe Cys His Ser His Arg 



355 360 365 

GTC CTC CAC CGA GAC CTT AAA CCT CAG AAT CTG CTT ATT AAC AC A GAG 1152 
Val Leu His Arg Asp Leu Lys Pro Gin Asn Leu Leu lie Asn Thr Glu 
370 375 380 

GGG GCC ATC AAG CTA GCA GAC TTT GGA CTA GCC AGA GCT TTT GGA GTC 1200 
Gly Ala lie Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Gly Val 
385 390 395 400 

CCT GTT CGT ACT TAC ACC CAT GAG GTG GTG ACC CTG TGG TAC CGA GCT 1248 
Pro Val Arg Thr Tyr Thr His Glu Val Val Thr Leu Trp Tyr Arg Ala 
405 410 415 

CCT GAA ATC CTC CTG GGC TCG AAA TAT TAT TCC ACA GCT GTG GAC ATC 1296 
Pro Glu lie Leu Leu Gly Ser Lys Tyr Tyr Ser Thr Ala Val Asp lie 
420 425 430 

TGG AGC CTG GGC TGC ATC TTT GCT GAG ATG GTG ACT CGC CGG GCC CTG 13 44 
Trp Ser Leu Gly Cys lie Phe Ala Glu Met Val Thr Arg Arg Ala Leu 
435 440 445 

TTC CCT GGA GAT TCT GAG ATT GAC CAG CTC TTC CGG ATC TTT CGG ACT 1392 
Phe Pro Gly Asp Ser Glu lie Asp Gin Leu Phe Arg lie Phe Arg Thr 
450 455 460 

CTG GGG ACC CCA GAT GAG GTG GTG TGG CCA GGA GTT ACT TCT ATG CCT 1440 
Leu Gly Thr Pro Asp Glu Val Val Trp Pro Gly Val Thr Ser Met Pro 
465 470 475 480 

GAT TAC AAG CCA AGT TTC CCC AAG TGG GCC CGG CAA GAT TTT AGT AAA 1488 
Asp Tyr Lys Pro Ser Phe Pro Lys Trp Ala Arg Gin Asp Phe Ser Lys 
485 490 495 

GTT GTA CCT CCC CTG GAT GAA GAT GGA CGG AGC TTG TTA TCG CAA ATG 1536 
Val Val Pro Pro Leu Asp Glu Asp Gly Arg Ser Leu Leu Ser Gin Met 
500 505 510 

CTG CAC TAC GAC CCT AAC AAG CGG ATT TCG GCC AAG GCA GCC CTG GCT 1584 
Leu His Tyr Asp Pro Asn Lys Arg lie Ser Ala Lys Ala Ala Leu Ala 
515 520 525 

CAC CCT TTC TTC CAG GAT GTG ACC AAG CCA GTA CCC CAT CTT CGA CTC T 1633 
His Pro Phe Phe Gin Asp Val Thr Lys Pro Val Pro His Leu Arg Leu 
530 535 540 

GA 1635 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: interna] 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
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He 
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He 
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He 
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315 
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Pro 
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Leu 
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He 
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He 
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Ala 
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Ala 
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400 


Pro 


Val 
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Val 
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Trp 
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415 
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Trp 
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He 
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Glu 
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435 

Phe Pro Gly Asp 
450 

Leu Gly Thr Pro 
465 

Asp Tyr Lys Pro 

Val Val Pro Pro 
500 

Leu His Tyr Asp 
515 

His Pro Phe Phe 
530 



440 

Ser Glu lie Asp 
455 

Asp Glu Val Val 
470 

Ser Phe Pro Lys 
485 

Leu Asp Glu Asp 

Pro Asn Lys Arg 

520 

Gin Asp Val Thr 
535 



Gin Leu Phe Arg 
460 

Trp Pro Gly Val 
475 

Trp Ala Arg Gin 
490 

Gly Arg Ser Leu 
505 

lie Ser Ala Lys 

Lys Pro Val Pro 
540 



445 

He Phe Arg Thr 

Thr Ser Met Pro 
480 

Asp Phe Ser Lys 
495 

Leu Ser Gin Met 
510 

Ala Ala Leu Ala 

525 

His Leu Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 116: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
<ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...2529 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



/5" 3 



GTG AAG TTC GAG G3C GAC ACC CTG GTG AAG CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 4 80 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 72 0 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GAG ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC 7 68 

Gly Leu Arg Ser Arg Glu Met Leu Ser Arg Gly Trp Phe His Arg Asp 
245 250 255 

CTC AGT GGG CTG GAT GCA GAG ACC CTG CTC AAG GGC CGA GGT GTC CAC 816 
Leu Ser Gly Leu Asp Ala Glu Thr Leu Leu Lys Gly Arg Gly Val His 
260 265 270 

GGT AGC TTC CTG GCT CGG CCC AGT CGC AAG AAC CAG GGT GAC TTC TCG 864 
Gly Ser Phe Leu Ala Arg Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser 
275 280 285 

CTC TCC GTC AGG GTG GGG GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC 912 
Leu Ser Val Arg Val Gly Asp Gin Val Thr His He Arg He Gin Asn 
290 295 300 

TCA GGG GAT TTC TAT GAC CTG TAT GGA GGG GAG A^G TTT GCG ACT CTG 9 6C 

Ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu 
305 310 315 320 

ACA GAG CTG GTG GAG TAC TAC ACT CAG CAG CA3 GGT GTC CTG CAG GAC 10 08 

Thr Glu Leu Val Glu Tyr Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp 
325 330 335 

CGC GAC GGC ACC ATC ATC CAC CTC AAG TAC CCG CTC AAC TGC TCC GAT 10 56 



Arg Asp Gly Thr lie lie His Leu Lys Tyr Pro Leu Asn Cys Ser Asp 
340 345 350 

CCC ACT AGT GAG AGG TGG TAG CAT GGC CAC ATG TCT GGC GGG CAG GCA 1104 

Pro Thr Ser Glu Arg Trp Tyr His Gly His Met Ser Gly Gly Gin Ala 
355 360 365 

GAG ACG CTG CTG CAG GCC AAG GGC GAG CCC TGG ACG TTT CTT GTG CGT 1152 

Glu Thr Leu Leu Gin Ala Lys Gly Glu Pro Trp Thr Phe Leu Val Arg 
370 375 380 

GAG AGC CTC AGC CAG CCT GGA GAC TTC GTG CTT TCT GTG CTC AGT GAC 1200 

Glu Ser Leu Ser Gin Pro Gly Asp Phe Val Leu Ser Val Leu Ser Asp 
385 390 395 400 

CAG CCC AAG GCT GGC CCA GGC TCC CCG CTC AGG GTC ACC CAC ATC AAG 1248 

Gin Pro Lys Ala Gly Pro Gly Ser Pro Leu Arg Val Thr His lie Lys 

405 410 415 

GTC ATG TGC GAG GGT GGA CGC TAC ACA GTG GGT GGT TTG GAG ACC TTC 1296 

Val Met Cys Glu Gly Gly Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe 
420 425 430 

GAC AGC CTC ACG GAC CTG GTA GAG CAT TTC AAG AAG ACG GGG ATT GAG 1344 

Asp Ser Leu Thr Asp Leu Val Glu His Phe Lys Lys Thr Gly lie Glu 
435 440 445 

GAG GCC TCA GGC GCC TTT GTC TAC CTG CGG CAG CCG TAC TAT GCC ACG 1392 

Glu Ala Ser Gly Ala Phe Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr 
450 455 460 

AGG GTG AAT GCG GCT GAC ATT GAG AAC CGA GTG TTG GAA CTG AAC AAG 1440 

Arg Val Asn Ala Ala Asp lie Glu Asn Arg Val Leu Glu Leu Asn Lys 
465 470 475 480 

AAG CAG GAG TCC GAG GAT ACA GCC AAG GCT GGC TTC TGG GAG GAG TTT 14 88 

Lys Gin Glu Ser Glu Asp Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe 

485 490 495 

GAG AGT TTG CAG AAG CAG GAG GTG AAG AAC TTG CAC CAG CGT CTG GAA 1536 

Glu Ser Leu Gin Lys Gin Glu Val Lys Asn Leu His Gin Arg Leu Glu 
500 505 510 

GGG CAG CGG CCA GAG AAC AAG GGC AAG AAC CGC TAC AAG AAC ATT CTC 1584 

Gly Gin Arg Pro Glu Asn Lys Gly Lys Asn Arg Tyr Lys Asn lie Leu 
515 520 525 

CCC TTT GAC CAC AGC CGA GTG ATC CTG CAG GGA CGG GAC AGT AAC ATC 1632 

Pro Phe Asp His Ser Arg Val He Leu Gin Gly Arg Asp Ser Asn He 
530 535 540 

CCC GGG TCC GAC TAC ATC AAT GCC AAC TAC ATC AAG AAC CAG CTG CTA 1680 

Pro Gly Ser Asp Tyr He Asn Ala Asn Tyr He Lys Asn Gin Leu Leu 
545 550 555 560 

GGC CCT GAT GAG AAC GCT AAG ACC TAC ATC GCC AGC CAG GGC TGT CTG 172 8 

Gly Pro Asp Glu Asn Ala Lys Thr Tyr lie Ala Ser Gin Gly Cys Leu 

565 570 575 
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GAG GCC ACG GTC AAT GAC TTC TGG CAG ATG GCG TGG CAG GAG AAC AGC 17 7 6 

Glu Ala Thr Val Asn Asp Phe Trp Gin Met Ala Trp Gin Glu Asn Ser 
580 585 590 

CGT GTC ATC GTC ATG ACC ACC CGA GAG GTG GAG AAA GGC CGG AAC AAA 1824 
Arg Val lie Val Met Thr Thr Arg Glu Val Glu Lys Gly Arg Asn Lys 
595 600 605 

TGC GTC CCA TAC TGG CCC GAG GTG GGC ATG CAG CGT GCT TAT GGG CCC 1872 
Cys Val Pro Tyr Trp Pro Glu Val Gly Met Gin Arg Ala Tyr Gly Pro 
610 615 620 

TAC TCT GTG ACC AAC TGC GGG GAG CAT GAC ACA ACC GAA TAC AAA CTC 1920 
Tyr Ser Val Thr Asn Cys Gly Glu His Asp Thr Thr Glu Tyr Lys Leu 
625 630 635 640 

CGT ACC TTA CAG GTC TCC CCG CTG GAC AAT GGA GAC CTG ATT CGG GAG 1968 
Arg Thr Leu Gin Val Ser Pro Leu Asp Asn Gly Asp Leu lie Arg Glu 
645 650 655 

ATC TGG CAT TAC CAG TAC CTG AGC TGG CCC GAC CAT GGG GTC CCC AGT 2016 
He Trp His Tyr Gin Tyr Leu Ser Trp Pro Asp His Gly Val Pro Ser 
660 665 670 

GAG CCT GGG GGT GTC CTC AGC TTC CTG GAC CAG ATC AAC CAG CGG CAG 2064 
Glu Pro Gly Gly Val Leu Ser Phe Leu Asp Gin He Asn Gin Arg Gin 
675 680 685 

GAA AGT CTG CCT CAC GCA GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC 2112 
Glu Ser Leu Pro His Ala Gly Pro He He Val His Cys Ser Ala Gly 
690 695 700 

ATC GGC CGC ACA GGC ACC ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC 2160 
He Gly Arg Thr Gly Thr He He Val He Asp Met Leu Met Glu Asn 
705 710 715 720 

ATC TCC ACC AAG GGC CTG GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC 22 08 
He Ser Thr Lys Gly Leu Asp Cys Asp He Asp He Gin Lys Thr He 
725 730 735 

CAG ATG GTG CGG GCG CAG CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG 2256 
Gin Met Val Arg Ala Gin Arg Ser Gly Met Val Gin Thr Glu Ala Gin 
740 745 750 

TAC AAG TTC ATC TAC GTG GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG 23 04 
Tyr Lys Phe He Tyr Val Ala He Ala Gin Phe He Glu Thr Thr Lys 
755 760 765 

AAG AAG CTG GAG GTC CTG CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC 2 3 52 
Lys Lys Leu Glu Val Leu Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr 
770 775 780 

GGG AAC ATC ACC TAT CCC CCA OZC ATG AAG AAT GCC CAT GCC AAG GCC 2 400 
Gly Asn He Thr Tyr Pro Pro Ala Met Lys Asn Ala His Ala Lys Ala 
785 790 795 800 

TCC CGC ACC TCG TCC AAA CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC 2448 
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Ser Arg Thr Ser Ser Lys His Lys Glu Asp Val Tyr Glu Asn Leu His 
805 810 815 

ACT AAG AAC AAG AGG GAG GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC 2496 

Thr Lys Asn Lys Arg Glu Glu Lys Val Lys Lys Gin Arg Ser Ala Asp 
820 825 830 

AAG GAG AAG AGC AAG GGT TCC CTC AAG AGG AAG TGA 2 53 2 

Lys Glu Lys Ser Lys Gly Ser Leu Lys Arg Lys 

835 840 



(2) INFORMATION FOR SEQ ID NO: 117: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Glu Met Leu Ser Arg Gly Trp Phe His Arg Asp 

245 250 255 

Leu Ser Gly Leu Asp Ala Glu Thr Leu Leu Lys Gly Arg Gly Val His 



260 265 270 

Gly Ser Phe Leu Ala Arg Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser 

275 280 285 

Leu Ser Val Arg Val Gly Asp Gin Val Thr His He Arg He Gin Asn 

290 295 300 

Ser Gly Asp Phe Tyr Asp Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu 
305 310 315 320 

Thr Glu Leu Val Glu Tyr Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp 

325 330 335 

Arg Asp Gly Thr He He His Leu Lys Tyr Pro Leu Asn Cys Ser Asp 

340 345 350 

Pro Thr Ser Glu Arg Trp Tyr His Gly His Met Ser Gly Gly Gin Ala 

355 360 365 

Glu Thr Leu Leu Gin Ala Lys Gly Glu Pro Trp Thr Phe Leu Val Arg 

370 375 380 

Glu Ser Leu Ser Gin Pro Gly Asp Phe Val Leu Ser Val Leu Ser Asp 
385 390 395 400 

Gin Pro Lys Ala Gly Pro Gly Ser Pro Leu Arg Val Thr His He Lys 

405 410 415 

Val Met Cys Glu Gly Gly Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe 

420 425 430 

Asp Ser Leu Thr Asp Leu Val Glu His Phe Lys Lys Thr Gly He Glu 

435 440 445 

Glu Ala Ser Gly Ala Phe Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr 

450 455 460 

Arg Val Asn Ala Ala Asp He Glu Asn Arg Val Leu Glu Leu Asn Lys 
465 470 475 480 

Lys Gin Glu Ser Glu Asp Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe 

485 490 495 

Glu Ser Leu Gin Lys Gin Glu Val Lys Asn Leu His Gin Arg Leu Glu 

500 505 510 

Gly Gin Arg Pro Glu Asn Lys Gly Lys Asn Arg Tyr Lys Asn He Leu 

515 520 525 

Pro Phe Asp His Ser Arg Val He Leu Gin Gly Arg Asp Ser Asn lie 

530 535 540 

Pro Gly Ser Asp Tyr He Asn Ala Asn Tyr He Lys Asn Gin Leu Leu 
545 550 555 560 

Gly Pro Asp Glu Asn Ala Lys Thr Tyr He Ala Ser Gin Gly Cys Leu 

565 570 575 

Glu Ala Thr Val Asn Asp Phe Trp Gin Met Ala Trp Gin Glu Asn Ser 

580 585 590 

Arg Val He Val Met Thr Thr Arg Glu Val Glu Lys Gly Arg Asn Lys 

595 600 605 

Cys Val Pro Tyr Trp Pro Glu Val Gly Met Gin Arg Ala Tyr Gly Pro 

610 615 620 

Tyr Ser Val Thr Asn Cys Gly Glu His Asp Thr Thr Glu Tyr Lys Leu 
625 630 635 640 

Arg Thr Leu Gin Val Ser Pro Leu Asp Asn Gly Asp Leu He Arg Glu 

645 650 655 

He Trp His Tyr Gin Tyr Leu Ser Trp Pro Asp His Gly Val Pro Ser 

660 665 670 

Glu Pro Gly Gly Val Leu Ser Phe Leu Asp Gin lie Asn Gin Arg Gin 

675 680 685 

Glu Ser Leu Pro His Ala Gly Pro He He Val His Cys Ser Ala Gly 

690 695 700 

He Gly Arg Thr Gly Thr lie lie Val He Asp Met Leu Met Glu Asn 
705 710 715 720 

lie Ser Thr Lys Gly Leu Asp Cys Asp lie Asp He Gin Lys Thr lie 



725 730 735 

Gin Met Val Arg Ala Gin Arg Ser Gly Met Val Gin Thr Glu Ala Gin 

740 745 750 

Tyr Lys Phe He Tyr Val Ala He Ala Gin Phe He Glu Thr Thr Lys 

755 760 765 

Lys Lys Leu Glu Val Leu Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr 

770 775 780 

Gly Asn He Thr Tyr Pro Pro Ala Met Lys Asn Ala His Ala Lys Ala 
785 790 795 800 

Ser Arg Thr Ser Ser Lys His Lys Glu Asp Val Tyr Glu Asn Leu His 

805 810 815 

Thr Lys Asn Lys Arg Glu Glu Lys Val Lys Lys Gin Arg Ser Ala Asp 

820 825 830 

Lys Glu Lys Ser Lys Gly Ser Leu Lys Arg Lys 
835 840 



{2) INFORMATION FOR SEQ ID NO: 118: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2562 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 
(13) LOCATION: 1 . . .2559 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

ATG CTG TCC CGT GGG TGG TTT CAC CGA GAC CTC AGT GGG CTG GAT GCA 48 
Met Leu Ser Arg Gly Trp Phe His Arg Asp Leu Ser Gly Leu Asp Ala 
15 10 15 

GAG ACC CTG CTC AAG GGC CGA GGT GTC CAC GGT AGC TTC CTG GOT CGG 96 
Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arg 
20 25 30 

CCC AGT CGC AAG AAC CAG GGT GAC TTC TCG CTC TCC GTC AGG GTG GGG 144 
Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 
35 40 45 

GAT CAG GTG ACC CAT ATT CGG ATC CAG AAC TCA GGG GAT TTC TAT GAC 192 
Asp Gin Val Thr His He Arg He Gin Asn Ser Gly Asp Phe Tyr Asp 
50 55 60 

CTG TAT GGA GGG GAG AAG TTT GCG ACT CTG ACA GAG CTG GTG GAG TAC 240 
Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Tyr 
65 70 75 80 

TAC ACT CAG CAG CAG GGT GTC CTG CAG GAC CGC GAC GGC ACC ATC ATC 2 88 

Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Thr He He 
85 90 95 

CAC CTC AAG TAC CCG CTG AAC TGC TCC GAT CCC ACT AGT GAG AGG TGG 3 36 



His Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 
100 105 110 

TAC CAT GGC CAC ATG TCT GGC GGG CAG GCA GAG ACG CTG CTG CAG GCC 3 84 

Tyr His Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Gin Ala 
115 120 125 

AAG GGC GAG CCC TGG ACG TTT CTT GTG CGT GAG AGC CTC AGC CAG CCT 4 32 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 
130 135 140 

GGA GAC TTC GTG CTT TCT GTG CTC AGT GAC CAG CCC AAG GCT GGC CCA 480 
Gly Asp Phe Val Leu Ser Val Leu Ser Asp Gin Pro Lys Ala Gly Pro 
145 150 155 160 

GGC TCC CCG CTC AGG GTC ACC CAC ATC AAG GTC ATG TGC GAG GGT GGA 528 
Gly Ser Pro Leu Arg Val Thr His He Lys Val Met Cys Glu Gly Gly 
165 170 175 

CGC TAC ACA GTG GGT GGT TTG GAG ACC TTC GAC AGC CTC ACG GAC CTG 57 6 

Arg Tyr Thr Val Gly Gly Leu Glu Thx Phe Asp Ser Leu Thr Asp Leu 
180 185 190 

GTA GAG CAT TTC AAG AAG ACG GGG ATT GAG GAG GCC TCA GGC GCC TTT 624 
Val Glu His Phe Lys Lys Thx Gly He Glu Glu Ala Ser Gly Ala Phe 
195 200 205 

GTC TAC CTG CGG CAG CCG TAC TAT GCC ACG AGG GTG AAT GCG GCT GAC 672 
Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
210 215 220 

ATT GAG AAC CGA GTG TTG GAA CTG AAC AAG AAG CAG GAG TCC GAG GAT 720 
He Glu Asn Arg Val Leu Glu Leu Asn Lys Lys Gin Glu Ser Glu Asp 
225 " 230 235 240 

ACA GCC AAG GCT GGC TTC TGG GAG GAG TTT GAG AGT TTG CAG AAG CAG 7 68 

Thr Ala Lys Ala Gly Phe Trp Glu Glu Phe Glu Ser Leu Gin Lys Gin 
245 250 255 

GAG GTG AAG AAC TTG CAC CAG CGT CTG GAA GGG CAG CGG CCA GAG AAC 816 
Glu Val Lys Asn Leu His Gin Arg Leu Glu Gly Gin Arg Pro Glu Asn 
260 265 270 

AAG GGC AAG AAC CGC TAC AAG AAC ATT CTC CCC TTT GAC CAC AGC CGA 864 
Lys Gly Lys Asn Arg Tyr Lys Asn He Leu Pro Phe Asp His Ser Arg 
275 280 285 

GTG ATC CTG CAG GGA CGG GAC AGT AAC ATC CCC GGG TCC GAC TAC ATC 912 
Val He Leu Gin Gly Arg Asp Ser Asn He Pro Gly Ser Asp Tyr He 
290 295 300 

AAT GCC AAC TAC ATC AAG AAC CAG CTG CTA GGC CCT GAT GAG AAC GCT 96C 
Asn Ala Asn Tyr He Lys Asn Gin Leu Leu Gly Pro Asp Glu Asn Ala 
305 310 315 320 

AAG ACC TAC ATC GCC AGC CAG GGC TGT CTG GAG GCC ACG GTC AAT GAC 10 0S 

Lys Thr Tyr He Ala Ser Gin Gly Cys Leu Glu Ala Thr Val Asn Asp 
325 330 335 



TTC TGG CAG ATG GCG TGG CAG GAG AAC AGC CGT GTC ATC GTC ATG ACC 10 56 

Phe Trp Gin Met Ala Trp Gin Glu Asn Ser Arg Val lie Val Met Thr 
340 345 350 

ACC CGA GAG GTG GAG AAA GGC CGG AAC AAA TGC GTC CCA TAC TGG CCC 1104 
Thr Arg Glu Val Glu Lys Gly Arg Asn Lys Cys Val Pro Tyr Trp Pro 
355 360 365 

GAG GTG GGC ATG CAG CGT GCT TAT GGG CCC TAC TCT GTG ACC AAC TGC 1152 
Glu Val Gly Met Gin Arg Ala Tyr Gly Pro Tyr Ser Val Thr Asn Cys 
370 375 380 

GGG GAG CAT GAC ACA ACC GAA TAC AAA CTC CGT ACC TTA CAG GTC TCC 12 00 
Gly Glu His Asp Thr Thr Glu Tyr Lys Leu Arg Thr Leu Gin Val Ser 
385 390 395 400 

CCG CTG GAC AAT GGA GAC CTG ATT CGG GAG ATC TGG CAT TAC CAG TAC 1248 
Pro Leu Asp Asn Gly Asp Leu lie Arg Glu He Trp His Tyr Gin Tyr 
405 410 415 

CTG AGC TGG CCC GAC CAT GGG GTC CCC AGT GAG CCT GGG GGT GTC CTC 12 96 
Leu Ser Trp Pro Asp His Gly Val Pro Ser Glu Pro Gly Gly Val Leu 
420 425 430 

AGC TTC CTG GAC CAG ATC AAC CAG CGG CAG GAA AGT CTG CCT CAC GCA 13 44 

Ser Phe Leu Asp Gin He Asn Gin Arg Gin Glu Ser Leu Pro His Ala 
435 440 445 

GGG CCC ATC ATC GTG CAC TGC AGC GCC GGC ATC GGC CGC ACA GGC ACC 13 92 
Gly Pro He He Val His Cys Ser Ala Gly He Gly Arg Thr Gly Thr 
450 455 460 

ATC ATT GTC ATC GAC ATG CTC ATG GAG AAC ATC TCC ACC AAG GGC CTG 1440 
He He Val lie Asp Met Leu Met Glu Asn He Ser Thr Lys Gly Leu 
465 470 475 480 

GAC TGT GAC ATT GAC ATC CAG AAG ACC ATC CAG ATG GTG CGG GCG CAG 14 88 

Asp Cys Asp He Asp He Gin Lys Thr He Gin Met Val Arg Ala Gin 
485 490 495 

CGC TCG GGC ATG GTG CAG ACG GAG GCG CAG TAC AAG TTC ATC TAC GTG 153 6 
Arg Ser Gly Met Val Gin Thr Glu Ala Gin Tyr Lys Phe He Tyr Val 
500 505 510 

GCC ATC GCC CAG TTC ATT GAA ACC ACT AAG AAG AAG CTG GAG GTC CTG 1584 
Ala He Ala Gin Phe He Glu Thr Thr Lys Lys Lys Leu Glu Val Leu 
515 520 525 

CAG TCG CAG AAG GGC CAG GAG TCG GAG TAC GGG AAC ATC ACC TAT CCC 16 32 

Gin Ser Gin Lys Gly Gin Glu Ser Glu Tyr Gly Asn He Thr Tyr Pro 
530 535 540 

CCA GCC ATG AAG AAT GCC CAT GCC AAG GCC TCC CGC ACC TCG TCC AAA 1680 
Pro Ala Met Lys Asn Ala His Ala Lys Ala Ser Arg Thr Ser Ser Lys 
545 550 555 560 

CAC AAG GAG GAT GTG TAT GAG AAC CTG CAC ACT AAG AAC AAG AGG GAG 1728 



His Lys Glu Asp Val Tyr Glu Asn Leu His Thr Lys Asn Lys Arg Glu 
565 570 575 

GAG AAA GTG AAG AAG CAG CGG TCA GCA GAC AAG GAG AAG AGC AAG GGT 177 6 
Glu Lys Val Lys Lys Gin Arg Ser Ala Asp Lys Glu Lys Ser Lys Gly 
580 585 590 

TCC CTC AAG AGG AAG CGA ATT CTG CAG TCG ACG GTA CCG CGG GCC CGG 1824 
Ser Leu Lys Arg Lys Arg He Leu Gin Ser Thr Val Pro Arg Ala Arg 
595 600 605 

GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC 187 2 
Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr 
610 615 620 

GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC 192 0 
Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 
625 630 635 640 

AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG 1968 
Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 
645 650 655 

CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG 2 016 
Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 
660 665 670 

CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC 2 064 
Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg 
675 680 685 

TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC 2112 
Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 
690 695 700 

GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC 2160 
Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn 
705 710 715 720 

TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC 22 08 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 
725 730 735 

CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG 2256 
Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu 
740 745 750 

GCG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG 2 3 04 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met 
755 760 765 

GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC 2 3 52 

Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His 
770 775 780 

AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC 2 4 00 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
785 790 795 800 



ACC CCC ATC GGC GAC GGC CCC GTC CTG CTG CCC GAC AAC CAC TAC CTG 2 44S 
Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 
805 810 815 

AGO ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC 2 496 
Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 
820 825 830 

ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG 2 54 4 
Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met 
835 840 845 

GAC GAG CTG TAC AAG TAA 2 562 

Asp Glu Leu Tyr Lys 

850 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 853 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Met Leu Ser Arg Gly Trp Phe Kis Arg Asp Leu Ser Gly Leu Asp Ala 

15 10 15 

Glu Thr Leu Leu Lys Gly Arg Gly Val His Gly Ser Phe Leu Ala Arg 

20 25 30 

Pro Ser Arg Lys Asn Gin Gly Asp Phe Ser Leu Ser Val Arg Val Gly 

35 4C 45 

Asp Gin Val Thr His lie Arg He Gin Asn Ser Gly Asp Phe Tyr Asp 

50 55 60 

Leu Tyr Gly Gly Glu Lys Phe Ala Thr Leu Thr Glu Leu Val Glu Tyr 
65 70 75 80 

Tyr Thr Gin Gin Gin Gly Val Leu Gin Asp Arg Asp Gly Thr He He 

85 90 95 

His Leu Lys Tyr Pro Leu Asn Cys Ser Asp Pro Thr Ser Glu Arg Trp 

100 105 HO 

Tyr His Gly His Met Ser Gly Gly Gin Ala Glu Thr Leu Leu Gin Ala 

115 120 125 

Lys Gly Glu Pro Trp Thr Phe Leu Val Arg Glu Ser Leu Ser Gin Pro 

130 135 140 

Gly Asp Phe Val Leu Ser Val Leu Ser Asp Gin Pro Lys Ala Gly Pro 
145 150 155 160 

Gly Ser Pro Leu Arg Val Thr H: s He Lys Val Met Cys Glu Gly Gly 

165 170 175 

Arg Tyr Thr Val Gly Gly Leu Glu Thr Phe Asp Ser Leu Thr Asp Leu 

180 185 190 

Val Glu His Phe Lys Lys Thr Gly He Glu Glu Ala Ser Gly Ala Phe 

195 200 205 

Val Tyr Leu Arg Gin Pro Tyr Tyr Ala Thr Arg Val Asn Ala Ala Asp 
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210 

lie Glu Asn Arg 
225 

Thr Ala Lys Ala 

Glu Val Lys Asn 
260 

Lys Gly Lys Asn 
275 

Val He Leu Gin 
290 

Asn Ala Asn Tyr 
305 

Lys Thr Tyr He 

Phe Trp Gin Met 
340 

Thr Arg Glu Val 
355 

Glu Val Gly Met 
370 

Gly Glu His Asp 
385 

Pro Leu Asp Asn 

Leu Ser Trp Pro 
420 

Ser Phe Leu Asp 
435 

Gly Pro He He 
450 

lie lie Val lie 
465 

Asp Cys Asp He 

Arg Ser Gly Met 
500 

Ala He Ala Gin 
515 

Gin Ser Gin Lys 
530 

Pro Ala Met Lys 
545 

His Lys Glu Asp 

Glu Lys Val Lys 
580 

Ser Leu Lys Arg 
595 

Asp Pro Pro Val 
610 

Gly Val Val Pro 
625 

Lys Phe Ser Val 

Leu Thr Leu Lys 
660 

Pro Thr Leu Val 



215 

Val Leu Glu Leu 
230 

Gly Phe Trp Glu 
245 

Leu His Gin Arg 

Arg Tyr Lys Asn 
280 

Gly Arg Asp Ser 
295 

lie Lys Asn Gin 
310 

Ala Ser Gin Gly 
325 

Ala Trp Gin Glu 

Glu Lys Gly Arg 
360 

Gin Arg Ala Tyr 
375 

Thr Thr Glu Tyr 
390 

Gly Asp Leu lie 
405 

Asp His Gly Val 

Gin lie Asn Gin 
440 

Val His Cys Ser 
455 

Asp Met Leu Met 
470 

Asp He Gin Lys 
485 

Val Gin Thr Glu 

Phe lie Glu Thr 
52 0 

Gly Gin Glu Ser 
535 

Asn Ala His Ala 
550 

Val Tyr Glu Asn 
565 

Lys Gin Arg Ser 

Lys Arg lie Leu 
600 

Ala Thr Met Val 
615 

lie Leu Val Glu 
630 

Ser Gly Glu Gly 
645 

Phe He Cys Thr 
Thr Thr Leu Thr 



220 

Asn Lys Lys Gin 
235 

Glu Phe Glu Ser 
250 

Leu Glu Gly Gin 
265 

lie Leu Pro Phe 

Asn He Pro Gly 
300 

Leu Leu Gly Pro 
315 

Cys Leu Glu Ala 
330 

Asn Ser Arg Val 
345 

Asn Lys Cys Val 

Gly Pro Tyr Ser 
380 

Lys Leu Arg Thr 
395 

Arg Glu lie Trp 
410 

Pro Ser Glu Pro 
425 

Arg Gin Glu Ser 

Ala Gly lie Gly 
460 

Glu Asn lie Ser 
475 

Thr lie Gin Met 
490 

Ala Gin Tyr Lys 
505 

Thr Lys Lys Lys 

Glu Tyr Gly Asn 
540 

Lys Ala Ser Arg 
555 

Leu His Thr Lys 
570 

Ala Asp Lys Glu 
585 

Gin Ser Thr Val 

Ser Lys Gly Glu 
620 

Leu Asp Gly Asp 
635 

Glu Gly Asp Ala 

650 

Thr Gly Lys Leu 
665 

Tyr Gly Val Gin 



Glu Ser Glu Asp 
240 

Leu Gin Lys Gin 
255 

Arg Pro Glu Asn 
270 

Asp His Ser Arg 

285 

Ser Asp Tyr lie 

Asp Glu Asn Ala 
320 

Thr Val Asn Asp 
335 

lie Val Met Thr 
350 

Pro Tyr Trp Pro 
365 

Val Thr Asn Cys 

Leu Gin Val Ser 
400 

His Tyr Gin Tyr 
415 

Gly Gly Val Leu 
430 

Leu Pro His Ala 
445 

Arg Thr Gly Thr 

Thr Lys Gly Leu 
480 

Val Arg Ala Gin 
495 

Phe lie Tyr Val 
510 

Leu Glu Val Leu 
525 

lie Thr Tyr Pro 

Thr Ser Ser Lys 
560 

Asn Lys Arg Glu 
575 

Lys Ser Lys Gly 
590 

Pro Arg Ala Arg 

605 

Glu Leu Phe Thr 

Val Asn Gly His 
640 

Thr Tyr Gly Lys 
655 

Pro Val Pro Trp 
670 

Cys Phe Ser Arg 
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675 680 685 

Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro 

690 695 700 

Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn 
705 710 715 720 

Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

725 730 735 

Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 

740 745 750 

Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met 

755 760 765 

Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His 

770 775 780 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 
785 790 795 800 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

805 810 815 

Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

820 825 830 

Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met 

835 840 845 

Asp Glu Leu Tyr Lys 
850 



(2) INFORMATION FOR SEQ ID NO: 12 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1. . .2991 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 



Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr He Fhe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC G2C GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 7 20 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCG AAT TCG ACC ATG GAG CGG CCC 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Thr Met Glu Arg Pro 
245 250 255 

CCG GGG CTG CGG CCG GGC GCG GGC GGG CCC TGG GAG ATG CGG GAG CGG 816 
Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu Met Arg Glu Arg 
260 265 270 



CTG GGC ACC GGC GGC TTC GGG AAC GTC TGT CTG TAC CAG CAT CGG GAA 
Leu Gly Thr 3ly Gly Phe Gly Asn Val Cys Leu Tyr Gin His Arg Glu 
275 280 285 

CTT GAT CT2 AAA ATA GCA ATT AAG TCT TGT CGC CTA GAG CTA AGT ACC 
Leu Asp Leu Lys He Ala He Lys Ser Cys Arg Leu Glu Leu Ser Thr 
290 295 300 



864 



/66 



AAA AAC AGA GAA CGA TGG TGC CAT GAA ATC CAG ATT ATG AAG AAG TTG 960 
Lys Asn Arg Glu Arg Trp Cys His Glu lie Gin lie Met Lys Lys Leu 
305 310 315 320 

AAC CAT GCC AAT GTT GTA AAG GCC TGT GAT GTT CCT GAA GAA TTG AAT 1008 
Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro Glu Glu Leu Asn 
325 330 335 

ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA TAC TGT TCT GGA 1056 
lie Leu lie His Asp Val Pro Leu Leu Ala Met Glu Tyr Cys Ser Gly 
340 345 350 

GGA GAT CTC CGA AAG CTG CTC AAC AAA CCA GAA AAT TGT TGT GGA CTT 1104 
Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys Cys Gly Leu 
355 360 365 

AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA GGG TCT GGG ATT 1152 
Lys Glu Ser Gin lie Leu Ser Leu Leu Ser Asp lie Gly Ser Gly lie 
370 375 380 

CGA TAT TTG CAT GAA AAC AAA ATT ATA CAT CGA GAT CTA AAA CCT GAA 12 00 
Arg Tyr Leu His Glu Asn Lys lie lie His Arg Asp Leu Lys Pro Glu 
385 390 395 400 

AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA CAT AAA ATA ATT 1248 
Asn lie Val Leu Gin Asp Val Gly Gly Lys He He His Lys He He 
405 410 415 

GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT CTG TGT ACA TCT 12 96 
Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser Leu Cys Thr Ser 
420 425 430 

TTT GTG GGA ACA CTG CAG TAT CTG GCC CCA GAG CTC TTT GAG AAT AAG 13 44 
Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu Phe Glu Asn Lys 
435 440 445 

CCT TAC ACA GCC ACT GTT GAT TAT TGG AGC TTT GGG ACC ATG GTA TTT 13 92 

Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Met Val Phe 
450 455 460 

GAA TGT ATT GCT GGA TAT AGG CCT TTT TTG CAT CAT CTG CAG CCA TTT 1440 
Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 
465 470 475 480 

ACC TGG CAT GAG AAG ATT AAG AAG AAG GAT CCA AAG TGT ATA TTT GCA 14 88 
Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe Ala 
485 490 495 

TGT GAA GAG ATG TCA GGA GAA GTT CGG TTT AGT AGC CAT TTA CCT CAA 153 6 
Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser His Leu Pro Gin 
500 505 510 

CCA AAT AGC CTT TGT AGT TTA ATA GTA GAA CCC ATG GAA AAC TGG CTA 1584 
Pro Asn Ser Leu Cys Ser Leu lie Val Glu Pro Met Glu Asn Trp Leu 
515 520 525 

CAG TTG ATG TTG AAT TGG GAC CCT CAG CAG AGA GGA GGA CCT GTT GAC 1632 



/6? 



Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly Gly Pro Val Asp 
530 535 540 

CTT ACT TTG AAG CAG CCA AGA TGT TTT GTA TTA ATG GAT CAC ATT TTG 16 80 
Leu Thr Leu Lys Gin Pro Arg Cys phe Val Leu Met Asp His He Leu 
545 550 555 560 

AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT GCA AAG ATA ATT 1728 
Asn Leu Lys He Val His He Leu Asn Met Thr Ser Ala Lys He He 
565 570 575 

TCT TTT CTG TTA CCA CCT GAT GAA AGT CTT CAT TCA CTA CAG TCT CGT 1776 
Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 
580 585 590 

ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA GAA CTT CTT TCA 1824 
He Glu Arg Glu Thr Gly lie Asn Thr Gly Ser Gin Glu Leu Leu Ser 
595 600 605 

GAG ACA GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC TCT CAA TGT GTT 1872 
Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 
610 615 620 

CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT TAT TTG TTT GAT 1920 
Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 
625 630 635 640 

AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC AGA AGT TTA TCT 1968 
Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser Arg Ser Leu Ser 
645 650 655 

GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA CAG CTT CCA ATT 2016 
Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He Gin Leu Pro He 
660 665 670 

ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC TAT GTG TCT GGA 2064 
He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His Tyr Val Ser Gly 
675 680 685 

CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA AGG GCA GCA ATG 2112 
Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin Arg Ala Ala Met 
690 695 700 

TTA AGT CTT CTT AGA TAT AAT GCT AAC TTA ACA AAA ATG AAG AAC ACT 2160 
Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys Met Lys Asn Thr 
705 710 715 720 

TTG ATC TCA GCA TCA CAA CAA CTG AAA GCT AAA TTG GAG TTT TTT CAC 2208 
Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu Glu Phe Phe His 
725 730 735 

AAA AGC ATT CAG CTT GAC TTG GAG AGA TAC AGC GAG CAG ATG ACG TAT 22 56 

Lys Ser He Gin Leu Asp Leu Glu Arg Tyr Ser Glu Gin Met Thr Tyr 
740 745 750 

GGG ATA TCT TCA GAA AAA ATG CTA AAA GCA TGG AAA GAA ATG GAA GAA 2 3 04 

Gly lie Ser Ser Glu Lys Met Leu Lys Ala Trp Lys Glu Met Glu Glu 
755 760 765 



AAG GCC ATC CAC TAT GCT GAG GTT GGT GTC ATT GGA TAC CTG GAG GAT 2 3 52 
Lys Ala He His Tyr Ala Glu Val Gly Val He Gly Tyr Leu Glu Asp 
770 775 780 

CAG ATT ATG TCT TTG CAT GCT GAA ATC ATG GGG CTA CAG AAG AGC CCC 2400 
Gin He Met Ser Leu His Ala Glu He Met Gly Leu Gin Lys Ser Pro 
785 790 795 800 

TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG GAA CAG CGT GCC 2 448 
Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 
805 810 815 

ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA GAT CAC TCC TAC 2496 
lie Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 
820 825 830 

AGT GAC AGC ACA GAG ATG GTG AAA ATC ATT GTG CAC ACT GTG CAG AGT 2544 
Ser Asp Ser Thr Glu Met Val Lys He He Val His Thr Val Gin Ser 
835 840 845 

CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG AGC AAG TTG TTG 2592 
Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 
850 855 860 

GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG GTG GAA GTG GCC 2640 
Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys Val Glu Val Ala 
865 870 875 880 

CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG TTC ATG CAG GGA 2 688 
Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 
885 890 895 

AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT GCC TGT ACA CAG 2736 
Lys Arg Gin Lys Glu He Trp His Leu Leu Lys He Ala Cys Thr Gin 
900 905 910 

AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA GGT GCA GTA ACC 27 84 
Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu Gly Ala Val Thr 
915 920 925 

CCT CAG ACA TCA GCA TGG CTG CCC CCG ACT TCA GCA GAA CAT GAT CAT 2832 
Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 
930 935 940 

TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG ACT TCA GCA CAA 2880 
Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 
945 950 955 960 

ATG ATA GAA GAA AAT TTG AAC TGC CTT GGC CAT TTA AGC ACT ATT ATT 2928 
Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu Ser Thr He He 
965 970 975 

CAT GAG GCA AAT GAG GAA CAG GGC AAT AGT ATG ATG AAT CTT GAT TGG 297 6 
His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met Asn Leu Asp Trp 
980 985 990 

AGT TGG TTA ACA GAA TGA 2 994 



Ser Trp Leu Thr Glu 

995 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



Met 


Val 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr Gly Val 


Val 


Pro 


He 


Leu 


1 








5 










10 










15 




Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


Phe 


Ser 


Val 


Ser Gly 








20 










25 










30 






Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 






35 










40 










4 b 








Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 




50 










55 










60 










Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


65 










70 










75 










80 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 










85 










90 










95 




Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 








100 










105 










110 






Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 




Leu 


Lys 


Gly 






115 










120 


















lie 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu Gly His 


Lys 


Leu 


Glu 


Tyr 




130 










TIC 

13b 










140 










Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


He 


Met 


Ala 


Asp 


Lys 


Gin 


Lys 


Asn 


145 










150 










155 










160 


Gly 


He 


Lys 


Val 


Asn 


Phe 


Lys 


He 


Arg 


His 


Asn 


He 


Glu 


Asp 


Gly 


Ser 










165 










170 










175 




Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


Pro 


He 


Gly Asp Gly 








180 










185 










190 






Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 






195 










200 










205 








Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


His 


Met 


Val 


Leu 


Leu 


Glu 


Phe 




210 










215 










220 










Val 


Thr 


Ala 


Ala 


Gly 


He 


Thr 


Leu 


Gly 


Met 


Asp 


Glu 


Leu 


Tyr 


Lys 


Ser 


225 










230 










235 










240 


Gly 


Leu 


Arg 


Ser 


Arg 


Ala 


Gin 


Ala 


Ser 


Asn 


Ser 


Thr 


Met 


Glu 


Arg 


Pro 










245 










250 










255 




Pro 


Gly 


Leu 


Arg 


Pro 


Gly 


Ala 


Gly 


Gly 


Pro 


Trp 


Glu 


Met 


Arg 


Glu 


Arg 








260 










265 










270 






Leu 


Gly 


Thr 


Gly 


Gly 


Phe 


Gly 


Asn 


Val 


Cys 


Leu 


Tyr 


Gin 


His 


Arg 


Glu 






275 










280 










285 








Leu 


Asp 


Leu 


Lys 


He 


Ala 


He 


Lys 


Ser 


Cys 


Arg 


Leu 


Glu 


Leu 


Ser 


Thr 




290 










295 










300 










Lys 


Asn 


Arg 


Glu 


Arg 


Trp 


Cys 


His 


Glu 


He 


Gin 


lie 


Met 


Lys 


Lys 


Leu 


305 










310 










315 










320 


As n 


His 


Ala 


Asn 


Val 


Val 


Lys 


Ala 


Cys 


Asp 


Val 


Pro 


Glu 


Glu 


Leu 


Asn 



325 330 335 

lie Leu lie His Asp Val Pro Leu Leu Ala Met Glu T^x Cys Ser Gly 

340 345 350 

Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn Cys Cys Gly Leu 

355 360 365 

Lys Glu Ser Gin lie Leu Ser Leu Leu Ser Asp lie Gly Ser Gly lie 

370 375 380 

Arg Tyr Leu His Glu Asn Lys lie lie His Arg Asp Leu Lys Pro Glu 
385 390 395 400 

Asn He Val Leu Gin Asp Val Gly Gly Lys He He His Lys He He 

405 410 415 

Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser Leu Cys Thr Ser 

420 425 430 

Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu Phe Glu Asn Lys 

435 440 445 

Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly Thr Met Val Phe 

450 455 460 

Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His Leu Gin Pro Phe 
465 470 475 480 

Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys Cys He Phe Ala 

485 490 495 

Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser His Leu Pro Gin 

500 505 510 

Pro Asn Ser Leu Cys Ser Leu He Val Glu Pro Met Glu Asn Trp Leu 

515 520 525 

Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly Gly Pro Val Asp 

530 535 540 

Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met Asp His He Leu 
545 550 555 560 

Asn Leu Lys lie Val His He Leu Asn Met Thr Ser Ala Lys He He 

565 570 575 

Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser Leu Gin Ser Arg 

580 585 590 

He Glu Arg Glu Thr Gly lie Asn Thr Gly Ser Gin Glu Leu Leu Ser 

595 600 605 

Glu Thr Gly lie Ser Leu Asp Pro Arg Lys Pro Ala Ser Gin Cys Val 

610 615 620 

Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val Tyr Leu Phe Asp 
625 630 635 640 

Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser Arg Ser Leu Ser 

645 650 655 

Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys lie Gin Leu Pro He 

660 665 670 

He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His Tyr Val Ser Gly 

675 680 685 

Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin Arg Ala Ala Met 

690 695 700 

Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys Met Lys Asn Thr 
705 710 715 720 

Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu Glu Phe Phe His 

725 730 735 

Lys Ser lie Gin Leu Asp Leu Glu Arg Tyr Ser Glu Gin Met Thr Tyr 

740 745 750 

Gly lie Ser Ser Glu Lys Met Leu Lys Ala Trp Lys Glu Met Glu Glu 

755 760 765 

Lys Ala lie His Tyr Ala Glu Val Gly Val lie Gly Tyr Leu Glu Asp 

770 775 780 

Gin lie Met Ser Leu His Ala Glu lie Met Gly Leu Gin Lys Ser Pro 



785 790 795 800 

Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu Glu Gin Arg Ala 

805 810 815 

lie Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser Asp His Ser Tyr 

820 825 830 

Ser Asp Ser Thr Glu Met Val Lys lie lie Val His Thr Val Gin Ser 

835 840 845 

Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu Ser Lys Leu Leu 

850 855 860 

Gly Cys Lys Gin Lys lie lie Asp Leu Leu Pro Lys Val Glu Val Ala 
865 870 875 880 

Leu Ser Asn lie Lys Glu Ala Asp Asn Thr Val Met Phe Met Gin Gly 

885 890 895 

Lys Arg Gin Lys Glu lie Trp His Leu Leu Lys lie Ala Cys Thr Gin 

900 905 910 

Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu Gly Ala Val Thr 

915 920 925 

Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala Glu His Asp His 

930 935 940 

Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu Thr Ser Ala Gin 
945 950 955 960 

Met lie Glu Glu Asn Leu Asn Cys Leu Gly His Leu Ser Thr lie lie 

965 970 975 

His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met Asn Leu Asp Trp 

980 985 990 

Ser Trp Leu Thr Glu 
995 



{2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2991 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME /KEY: Coding Sequence 
<B) LOCATION: 1. . .2988 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



ATG GAG CGG CCC CCG GGG CTG CGG CCG GGC GCG GGC GGG CCC TGG GAG 4 8 

Met Glu Arg Pro Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu 
15 10 15 

ATG CGG GAG CGG CTG GGC ACC GGC GGC TTC G5G AAC GTC TGT CTG TAC 96 

Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 
20 25 30 

CAG CAT CGG GAA CTT GAT CTC AAA ATA GCA ATT AAG TCT TGT CGC CTA 144 

Gin His Arg Glu Leu Asp Leu Lys lie Ala lie Lys Ser Cys Arg Leu 
35 40 45 

GAG CTA AGT ACC AAA AAC AGA GAA CGA TGG TGC CAT GAA ATC CAG ATT 192 



Glu Leu Ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu lie Gin lie 

50 55 60 

ATG AAG AAG TTG AAC CAT GCC AAT GTT GTA AAG GCC TGT GAT GTT CCT 2 40 

Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 
65 70 75 80 

GAA GAA TTG AAT ATT TTG ATT CAT GAT GTG CCT CTT CTA GCA ATG GAA 2 88 

Glu Glu Leu Asn lie Leu lie His Asp Val Pro Leu Leu Ala Met Glu 
85 90 95 

TAC TGT TCT GGA GGA GAT CTC CGA AAG CTG CTC AAC AAA CCA GAA AAT 336 
Tyr Cys Ser Gly Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn 
100 105 110 

TGT TGT GGA CTT AAA GAA AGC CAG ATA CTT TCT TTA CTA AGT GAT ATA 3 84 

Cys Cys Gly Leu Lys Glu Ser Gin lie Leu Ser Leu Leu Ser Asp lie 
115 120 125 

GGG TCT GGG ATT CGA TAT TTG CAT GAA AAC AAA ATT ATA CAT CGA GAT 432 
Gly Ser Gly He Arg Tyr Leu His Glu Asn Lys He He His Arg Asp 
130 135 140 

CTA AAA CCT GAA AAC ATA GTT CTT CAG GAT GTT GGT GGA AAG ATA ATA 4 80 

Leu Lys Pro Glu Asn He Val Leu Gin Asp Val Gly Gly Lys He lie 
145 150 155 160 

CAT AAA ATA ATT GAT CTG GGA TAT GCC AAA GAT GTT GAT CAA GGA AGT 528 
His Lys He He Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser 
165 170 175 

CTG TGT ACA TCT TTT GTG GGA ACA CTG CAG TAT CTG GCC CCA GAG CTC 57 6 

Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu 
180 185 190 

TTT GAG AAT AAG CCT TAC ACA GCC ACT GTT GAT TAT TGG AGC TTT GGG 62 4 

Phe Glu Asn Lys Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly 
195 200 205 

ACC ATG GTA TTT GAA TGT ATT GCT GGA TAT AGG CCT TTT TTG CAT CAT 672 
Thr Met Val Phe Glu Cys He Ala Gly Tyr Arg Pro Phe Leu His His 
210 215 220 

CTG CAG CCA TTT ACC TGG CAT GAG AAG ATT AAG AAG AAG GAT CCA AAG 72 0 

Leu Gin Pro Phe Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys 

225 230 235 240 

TGT ATA TTT GCA TGT GAA GAG ATG TCA GGA GAA GTT CGG TTT AGT AGC 7 68 

Cys He Phe Ala Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser 
245 250 255 

CAT TTA CCT CAA CCA AAT AGC CTT TGT AGT TTA ATA GTA GAA CCC ATG 816 
His Leu Pro Gin Pro Asn Ser Leu Cys Ser Leu He Val Glu Pro Met 
260 265 270 

GAA AAC TGG CTA CAG TTG ATG TTG AAT TGG GAC CCT CAG CAG AGA GGA 564 
Glu Asn Trp Leu Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly 
275 280 285 
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GGA CCT GTT GAC CTT ACT TTG AAG CAG CCA AGA TGT TTT GTA TTA ATG 912 
Gly Pro Val Asp Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met 
290 295 300 

GAT CAC ATT TTG AAT TTG AAG ATA GTA CAC ATC CTA AAT ATG ACT TCT 960 
Asp His He Leu Asn Leu Lys He Val His lie Leu Asn Met Thr Ser 
305 310 315 320 

GCA AAG ATA ATT TCT TTT CTG TTA CCA CCT GAT GAA AGT CTT CAT TCA 10 08 
Ala Lys He He Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser 
325 330 335 

CTA CAG TCT CGT ATT GAG CGT GAA ACT GGA ATA AAT ACT GGT TCT CAA 1056 
Leu Gin Ser Arg He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin 
340 345 350 

GAA CTT CTT TCA GAG ACA GGA ATT TCT CTG GAT CCT CGG AAA CCA GCC 1104 
Glu Leu Leu Ser Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala 
355 360 365 

TCT CAA TGT GTT CTA GAT GGA GTT AGA GGC TGT GAT AGC TAT ATG GTT 1152 
Ser Gin Cys Val Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val 
370 375 380 

TAT TTG TTT GAT AAA AGT AAA ACT GTA TAT GAA GGG CCA TTT GCT TCC 1200 
Tyr Leu Phe Asp Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser 
385 390 395 400 

AGA AGT TTA TCT GAT TGT GTA AAT TAT ATT GTA CAG GAC AGC AAA ATA 1248 
Arg Ser Leu Ser Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He 
405 410 415 

CAG CTT CCA ATT ATA CAG CTG CGT AAA GTG TGG GCT GAA GCA GTG CAC 12 96 
Gin Leu Pro He He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His 
420 425 430 

TAT GTG TCT GGA CTA AAA GAA GAC TAT AGC AGG CTC TTT CAG GGA CAA. 1344 
Tyr Val Ser Gly Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin 
435 440 445 

AGG GCA GCA ATG TTA AGT CTT CTT AGA TAT AAT GCT AAC TTA ACA AAA 13 92 
Arg Ala Ala Met Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys 
450 455 460 

ATG AAG AAC ACT TTG ATC TCA GCA TCA CAA CAA CTG AAA GCT AAA TTG 1440 
Met Lys Asn Thr Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu 
465 470 475 480 

GAG TTT TTT CAC AAA AGC ATT CAG CTT GAC TTG GAG AGA TAC AGC GAG 14 88 
Glu Phe Phe His Lys Ser He Gin Leu Asp Leu Glu Arg Tyr Ser Glu 
485 490 49^ 

CAG ATG ACG TAT GGG ATA TCT TCA GAA AAA ATG CTA AAA GCA TGG AAA 1536 
Gin Met Thr Tyr Gly He Ser Ser Giu Lys Met Leu Lys Ala Trp Lys 
500 505 510 

GAA ATG GAA GAA AAG GCC ATC CAC TAT GCT GAG GTT GGT GTC ATT GGA 1584 



Glu Met Glu Glu Lys Ala lie His Tyr Ala Glu Val Gly Val lie Gly 
515 520 525 

TAC CTG GAG GAT CAG ATT ATG TCT TTG CAT GCT GAA ATC ATG GGG CTA 1632 
Tyr Leu Glu Asp Gin He Met Ser Leu His Ala Glu He Met Gly Leu 
530 535 540 

CAG AAG AGC CCC TAT GGA AGA CGT CAG GGA GAC TTG ATG GAA TCT CTG 1680 
Gin Lys Ser Pro Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu 
545 550 555 560 

GAA CAG CGT GCC ATT GAT CTA TAT AAG CAG TTA AAA CAC AGA CCT TCA 1728 
Glu Gin Arg Ala He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser 
565 570 575 

GAT CAC TCC TAC AGT GAC AGC ACA GAG ATG GTG AAA ATC ATT GTG CAC 177 6 

Asp His Ser Tyr Ser Asp Ser Thr Glu Met Val Lys He He Val His 
580 585 590 

ACT GTG CAG AGT CAG GAC CGT GTG CTC AAG GAG CTG TTT GGT CAT TTG 1824 
Thr Val Gin Ser Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu 
595 600 605 

AGC AAG TTG TTG GGC TGT AAG CAG AAG ATT ATT GAT CTA CTC CCT AAG 1872 
Ser Lys Leu Leu Gly Cys Lys Gin Lys He He Asp Leu Leu Pro Lys 
610 615 620 

GTG GAA GTG GCC CTC AGT AAT ATC AAA GAA GCT GAC AAT ACT GTC ATG 1920 
Val Glu Val Ala Leu Ser Asn He Lys Glu Ala Asp Asn Thr Val Met 
625 630 635 640 

TTC ATG CAG GGA AAA AGG CAG AAA GAA ATA TGG CAT CTC CTT AAA ATT 1968 
Phe Met Gin Gly Lys Arg Gin Lys Glu He Trp His Leu Leu Lys lie 
645 650 655 

GCC TGT ACA CAG AGT TCT GCC CGC TCT CTT GTA GGA TCC AGT CTA GAA 2016 
Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 
660 665 670 

GGT GGA GTA ACC CCT CAG ACA TCA GCA TGG CTG CCC CCG ACT TCA GCA 2064 
Gly Ala Val Thr Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala 
675 680 685 

GAA CAT GAT CAT TCT CTG TCA TGT GTG GTA ACT CCT CAA GAT GGG GAG 2112 
Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 
690 695 700 

ACT TCA GCA CAA ATG ATA GAA GAA AAT TTG AAC TGC CTT GGC CAT TTA 2160 
Thr Ser Ala Gin Met lie Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

AGC ACT ATT ATT CAT GAG GCA AAT GAG GAA CAG GGC AAT AGT ATG ATG 22 08 
Ser Thr He He His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 
725 730 735 

AAT CTT GAT TGG AGT TGG TTA ACA GAA TGG GTA CCG CGG GCC CGG GAT 22 56 
Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 
740 745 750 



CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG 2 3 04 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
755 760 765 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 23 52 
Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
770 775 780 

TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 2400 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
785 790 795 800 

ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 244 8 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
805 810 815 

ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2496 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
820 825 830 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 2 544 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
835 840 845 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2 592 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
850 855 860 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 2640 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
665 870 875 880 

ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 2688 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
885 890 895 

CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2736 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
900 905 910 

GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 2784 
Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
915 920 925 

ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 2 832 
He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
930 935 940 

CCC ATC GGC GAC GGC CCC GTC CTG CTG CCC GAC AAC CAC TAC CTG AGC 2 880 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
945 950 955 960 

ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG 292 8 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
965 970 975 

GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 2 97 6 
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Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 
980 985 990 

GAG CTG TAC AAG TAA 2991 
Glu Leu Tyr Lys 
995 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 996 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Met Glu Arg Pro Pro Gly Leu Arg Pro Gly Ala Gly Gly Pro Trp Glu 

15 10 15 

Met Arg Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val Cys Leu Tyr 

20 25 30 

Gin His Arg Glu Leu Asp Leu Lys He Ala He Lys Ser Cys Arg Leu 

35 40 45 

Glu Leu Ser Thr Lys Asn Arg Glu Arg Trp Cys His Glu He Gin He 

50 55 60 

Met Lys Lys Leu Asn His Ala Asn Val Val Lys Ala Cys Asp Val Pro 
65 70 75 80 

Glu Glu Leu Asn He Leu He His Asp Val Pro Leu Leu Ala Met Glu 

85 90 95 

Tyr Cys Ser Gly Gly Asp Leu Arg Lys Leu Leu Asn Lys Pro Glu Asn 

100 105 110 

Cys Cys Gly Leu Lys Glu Ser Gin He Leu Ser Leu Leu Ser Asp He 

115 120 125 

Gly Ser Gly He Arg Tyr Leu His Glu Asn Lys He He His Arg Asp 

130 135 140 

Leu Lys Pro Glu Asn He Val Leu Gin Asp Val Gly Gly Lys He He 
145 150 155 160 

His Lys He He Asp Leu Gly Tyr Ala Lys Asp Val Asp Gin Gly Ser 

165 170 175 

Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu Leu 

180 185 190 

Phe Glu Asn Lys Pro Tyr Thr Ala Thr Val Asp Tyr Trp Ser Phe Gly 

195 200 205 

Thr Met Val Phe Glu Cys lie Ala Gly Tyr Arg Pro Phe Leu His His 

210 215 220 

Leu Gin Pro Phe Thr Trp His Glu Lys He Lys Lys Lys Asp Pro Lys 
225 230 235 240 

Cys lie Phe Ala Cys Glu Glu Met Ser Gly Glu Val Arg Phe Ser Ser 

245 250 255 

His Leu Pro Gin Pro Asn Ser Leu Cys Ser Leu lie Val Glu Pro Met 

260 265 270 

Glu Asn Trp Leu Gin Leu Met Leu Asn Trp Asp Pro Gin Gin Arg Gly 

275 280 285 

Gly Pro Val Asp Leu Thr Leu Lys Gin Pro Arg Cys Phe Val Leu Met 
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290 295 300 

Asp His He Leu Asn Leu Lys He Val His He Leu Asn Met Thr Ser 
305 310 315 320 

Ala Lys He He Ser Phe Leu Leu Pro Pro Asp Glu Ser Leu His Ser 

325 330 335 

Leu Gin Ser Arg He Glu Arg Glu Thr Gly He Asn Thr Gly Ser Gin 

340 345 350 

Glu Leu Leu Ser Glu Thr Gly He Ser Leu Asp Pro Arg Lys Pro Ala 

355 360 365 

Ser Gin Cys Val Leu Asp Gly Val Arg Gly Cys Asp Ser Tyr Met Val 

370 375 380 

Tyr Leu Phe Asp Lys Ser Lys Thr Val Tyr Glu Gly Pro Phe Ala Ser 
385 390 395 400 

Arg Ser Leu Ser Asp Cys Val Asn Tyr He Val Gin Asp Ser Lys He 

405 410 415 

Gin Leu Pro He He Gin Leu Arg Lys Val Trp Ala Glu Ala Val His 

420 425 430 

Tyr Val Ser Gly Leu Lys Glu Asp Tyr Ser Arg Leu Phe Gin Gly Gin 

435 440 445 

Arg Ala Ala Met Leu Ser Leu Leu Arg Tyr Asn Ala Asn Leu Thr Lys 

450 455 460 

Met Lys Asn Thr Leu He Ser Ala Ser Gin Gin Leu Lys Ala Lys Leu 
465 470 475 480 

Glu Phe Phe His Lys Ser He Gin Leu Asp Leu Glu Arg Tyr Ser Glu 

485 490 495 

Gin Met Thr Tyr Gly He Ser Ser Glu Lys Met Leu Lys Ala Trp Lys 

500 505 510 

Glu Met Glu Glu Lys Ala He His Tyr Ala Glu Val Gly Val He Gly 

515 520 525 

Tyr Leu Glu Asp Gin He Met Ser Leu His Ala Glu He Met Gly Leu 

530 535 540 

Gin Lys Ser Pro Tyr Gly Arg Arg Gin Gly Asp Leu Met Glu Ser Leu 
545 550 555 560 

Glu Gin Arg Ala He Asp Leu Tyr Lys Gin Leu Lys His Arg Pro Ser 

565 570 575 

Asp His Ser Tyr Ser Asp Ser Thr Glu Met Val Lys He He Val His 

580 585 590 

Thr Val Gin Ser Gin Asp Arg Val Leu Lys Glu Leu Phe Gly His Leu 

595 600 605 

Ser Lys Leu Leu Gly Cys Lys Gin Lys He lie Asp Leu Leu Pro Lys 

610 615 620 

Val Glu Val Ala Leu Ser Asn lie Lys Glu Ala Asp Asn Thr Val Met 
625 630 635 640 

Phe Met Gin Gly Lys Arg Gin Lys Glu lie Trp His Leu Leu Lys He 

645 650 655 

Ala Cys Thr Gin Ser Ser Ala Arg Ser Leu Val Gly Ser Ser Leu Glu 

660 665 670 

Gly Ala Val Thr Pro Gin Thr Ser Ala Trp Leu Pro Pro Thr Ser Ala 

675 680 685 

Glu His Asp His Ser Leu Ser Cys Val Val Thr Pro Gin Asp Gly Glu 

690 695 700 

Thr Ser Ala Gin Met He Glu Glu Asn Leu Asn Cys Leu Gly His Leu 
705 710 715 720 

Ser Thr lie lie His Glu Ala Asn Glu Glu Gin Gly Asn Ser Met Met 

725 730 735 

Asn Leu Asp Trp Ser Trp Leu Thr Glu Trp Val Pro Arg Ala Arg Asp 

740 745 750 

Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 



755 760 765 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

770 775 780 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
785 790 795 800 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

805 810 815 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

820 825 830 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

835 840 845 

Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 

850 855 860 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
865 870 875 880 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

885 890 895 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 

900 905 910 

Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 

915 920 925 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 

930 935 940 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
945 950 955 960 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

965 970 975 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

980 985 990 

Glu Leu Tyr Lys 
995 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 1 . . .1905 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCC ATG AGC GAG ACG GTC ATC ATG 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ser Glu Thr Val He Met 
245 250 255 

A'GC GAG ACG GTC ATC TGT TCC AGC CGG GCC ACT GTG ATG CTT TAT GAT 816 
Ser Glu Thr Val He Cys Ser Ser Arg Ala Thr Val Met Leu Tyr Asp 
260 265 270 



GAT GGC AAC AAG CGA TGG CTC CCT GCT GGC ACG GGT CCC CAG GCC TTC 864 
Asp Gly Asn Lys Arg Trp Leu Pro Ala Gly Thr Gly Pro Gin Ala Phe 
275 280 285 

AGC CGC GTC CAG ATC TAC CAC AAC CCC ACG GCC AAT TCC TTT CGC GTC 912 
Ser Arg Val Gin lie Tyr His Asn Pro Thr Ala Asn Ser Phe Arg Val 
290 295 300 

GTG GGC CGG AAG ATG CAG CCC GAC CAG CAG GTG GTC ATC AAC TGT GCC 960 
Val Gly Arg Lys Met Gin Pro Asp Gin Gin Val Val He Asn Cys Ala 
305 310 315 320 

ATC GTC CGG GGT GTC AAG TAT AAC CAG GCC ACC CCC AAC TTC CAT CAG 1008 
He Val Arg Gly Val Lys Tyr Asn Gin Ala Thr Pro Asn Phe His Gin 
325 330 335 

TGG CGC GAC GCT CGC CAG GTC TGG GGC CTC AAC TTC GGC AGC AAG GAG 1056 
Trp Arg Asp Ala Arg Gin Val Trp Gly Leu Asn Phe Gly Ser Lys Glu 
340 345 350 

GAT GCG GCC CAG TTT GCC GCC GGC ATG GCC AGT GCC CTA GAG GCG TTG 1104 
Asp Ala Ala Gin Phe Ala Ala Gly Met Ala Ser Ala Leu Glu Ala Leu 
355 360 365 

GAA GGA GGT GGG CCC CCT CCA CCC CCA GCA CTT CCC ACC TGG TCG GTC 1152 
Glu Gly Gly Gly Pro Pro Pro Pro Pro Ala Leu Pro Thr Trp Ser Val 
370 375 380 

CCG AAC GGC CCC TCC CCG GAG GAG GTG GAG CAG CAG AAA AGG CAG CAG 1200 
Pro Asn Gly Pro Ser Pro Glu Glu Val Glu Gin Gin Lys Arg Gin Gin 
385 390 395 400 

CCC GGC CCG TCG GAG CAC ATA GAG CGC CGG GTC TCC AAT GCA GGA GGC 1248 
Pro Gly Pro Ser Glu His He Glu Arg Arg Val Ser Asn Ala Gly Gly 
405 410 415 

CCA CCT GCT CCC CCC GCT GGG GGT CCA CCC CCA CCA CCA GGA CCT CCC 1296 
Pro Pro Ala Pro Pro Ala Gly Gly Pro Pro Pro Pro Pro Gly Pro Pro 
420 425 430 

CCT CCT CCA GGT CCC CCC CCA CCC CCA GGT TTG CCC CCT TCG GGG GTC 1344 
Pro Pro Pro Gly Pro Pro Pro Pro Pro Gly Leu Pro Pro Ser Gly Val 
435 440 445 

CCA GCT GCA GCG CAC GGA GCA GGG GGA GGA CCA CCC CCT GCA CCC CCT 13 92 

Pro Ala Ala Ala His Gly Ala Gly Gly Gly Pro Pro Pro Ala Pro Pro 
450 455 460 

CTC CCG GCA GCA CAG GGC CCT GGT GGT GGG GGA GCT GGG GCC CCA GGC 1440 
Leu Pro Ala Ala Gin Gly Pro Gly Gly Gly Gly Ala Gly Ala Pro Gly 
465 470 475 480 

CTG GCC GCA GCT ATT GCT GGA GCC AAA CTC AGG AAA GTC AGC AAG CAG 14 88 
Leu Ala Ala Ala He Ala Gly Ala Lys Leu Arg Lys Val Ser Lys Gin 
485 490 495 

GAG GAG GCC TCA GGG GGG CCC ACA GCC CCC AAA GCT GAG AGT GGT CGA 1536 



Glu Glu Ala Ser Gly Gly Fro Thr Ala Pro Lys Ala Glu Ser Gly Arg 
500 505 510 

AGC GGA GGT GGG GGA CTC ATG GAA GAG ATG AAC GCC ATG CTG GCC CGG 1584 
Ser Gly Gly Gly Gly Leu Met Glu Glu Met Asn Ala Met Leu Ala Arg 
515 520 525 

AGA AGG AAA GCC ACG CAA GTT GGG GAG AAA ACC CCC AAG GAT GAA TCT 1632 
Arg Arg Lys Ala Thr Gin Val Gly Glu Lys Thr Pro Lys Asp Glu Ser 
530 535 540 

GCC AAT CAG GAG GAG CCA GAG GCC AGA GTC CCG GCC CAG AGT GAA TCT 16 80 
Ala Asn Gin Glu Glu Pro Glu Ala Arg Val Pro Ala Gin Ser Glu Ser 
545 550 555 560 

GTG CGG AGA CCC TGG GAG AAG AAC AGC ACA ACC TTG CCA AGG ATG AAG 1728 
Val Arg Arg Pro Trp Glu Lys Asn Ser Thr Thr Leu Pro Arg Met Lys 
565 570 575 

TCG TCT TCT TCG GTG ACC ACT TCC GAG ACC CAA CCC TGC ACG CCC AGC 177 6 

Ser Ser Ser Ser Val Thr Thr Ser Glu Thr Gin Pro Cys Thr Pro Ser 
580 585 590 

TCC AGT GAT TAC TCG GAC CTA CAG AGG GTG AAA CAG GAG CTT CTG GAA 1824 
Ser Ser Asp Tyr Ser Asp Leu Gin Arg Val Lys Gin Glu Leu Leu Glu 
595 600 605 

GAG GTG AAG AAG GAA TTG CAG AAA GTG AAA GAG GAA ATC ATT GAA GCC 1872 
Glu Val Lys Lys Glu Leu Gin Lys Val Lys Glu Glu He He Glu Ala 
610 615 620 

TTC GTC CAG GAG CTG AGG AAG CGG GGT TCT CCC TGA 1908 
Phe Val Gin Glu Leu Arg Lys Arg Gly Ser Pro 
625 630 635 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



65 

Gin His Asp Phe 

Arg Thr lie Phe 
100 

Val Lys Phe Glu 
115 

lie Asp Phe Lys 
130 

Asn Tyr Asn Ser 
145 

Gly lie Lys Val 

Val Gin Leu Ala 
180 

Pro Val Leu Leu 
195 

Ser Lys Asp Pro 
210 

Val Thr Ala Ala 
225 

Gly Leu Arg Ser 

Ser Glu Thr Val 
260 

Asp Gly Asn Lys 
275 

Ser Arg Val Gin 
290 

Val Gly Arg Lys 
305 

He Val Arg Gly 

Trp Arg Asp Ala 
340 

Asp Ala Ala Gin 
355 

Glu Gly Gly Gly 
370 

Pro Asn Gly Pro 

385 

Pro Gly Pro Ser 

Pro Pro Ala Pro 
420 

Pro Pro Pro Gly 
435 

Pro Ala Ala Ala 
450 

Leu Pro Ala Ala 
465 

Leu Ala Ala Ala 

Glu Glu Ala Ser 
500 

Ser Gly Gly Gly 
515 

Arg Arg Lys Ala 



70 

Phe Lys Ser Ala 
85 

Phe Lys Asp Asp 

Gly Asp Thr Leu 
120 

Glu Asp Gly Asn 
135 

His Asn Val Tyr 
150 

Asn Phe Lys He 
165 

Asp His Tyr Gin 

Pro Asp Asn His 
200 

Asn Glu Lys Arg 
215 

Gly He Thr Leu 
230 

Arg Ala Gin Ala 
245 

He Cys Ser Ser 

Arg Trp Leu Pro 
280 

He Tyr His Asn 
295 

Met Gin Pro Asp 
310 

Val Lys Tyr Asn 
325 

Arg Gin Val Trp 

Phe Ala Ala Gly 
360 

Pro Pro Pro Pro 
375 

Ser Pro Glu Glu 
390 

Glu His He Glu 
405 

Pro Ala Gly Gly 

Pro Pro Pro Pro 
440 

His Gly Ala Gly 
455 

Gin Gly Pro Gly 
470 

He Ala Gly Ala 
485 

Gly Gly Pro Thr 

Gly Leu Met Glu 
520 

Thr Gin Val Gly 
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75 

Met Pro Glu Gly 
90 

Gly Asn Tyr Lys 
105 

Val Asn Arg He 

He Leu Gly His 
140 

He Met Ala Asp 
155 

Arg His Asn He 
170 

Gin Asn Thr Pro 
185 

Tyr Leu Ser Thr 

Asp His Met Val 
220 

Gly Met Asp Glu 
235 

Ser Met Ser Glu 
250 

Arg Ala Thr Val 
265 

Ala Gly Thr Gly 

Pro Thr Ala Asn 
300 

Gin Gin Val Val 
315 

Gin Ala Thr Pro 
330 

Gly Leu Asn Phe 
345 

Met Ala Ser Ala 

Pro Ala Leu Pro 
380 

Val Glu Gin Gin 
395 

Arg Arg Val Ser 
410 

Pro Pro Pro Pro 
425 

Pro Gly Leu Pro 

Gly Gly Pro Pro 
460 

Gly Gly Gly Ala 
475 

Lys Leu Arg Lys 
490 

Ala Pro Lys Ala 
505 

Glu Met Asn Ala 
Glu Lys Thr Pro 



80 

Tyr Val Gin Glu 
95 

Thr Arg Ala Glu 
110 

Glu Leu Lys Gly 
125 

Lys Leu Glu Tyr 

Lys Gin Lys Asn 
160 

Glu Asp Gly Ser 
175 

He Gly Asp Gly 
190 

Gin Ser Ala Leu 
205 

Leu Leu Glu Phe 

Leu Tyr Lys Ser 
240 

Thr Val He Met 
255 

Met Leu Tyr Asp 
270 

Pro Gin Ala Phe 
285 

Ser Phe Arg Val 

He Asn Cys Ala 
320 

Asn Phe His Gin 
335 

Gly Ser Lys Glu 
350 

Leu Glu Ala Leu 
365 

Thr Trp Ser Val 

Lys Arg Gin Gin 
400 

Asn Ala Gly Gly 
415 

Pro Gly Pro Pro 
430 

Pro Ser Gly Val 
445 

Pro Ala Pro Pro 

Gly Ala Pro Gly 
480 

Val Ser Lys Gin 
495 

Glu Ser Gly Arg 
510 

Met Leu Ala Arg 

525 

Lys Asp Glu Ser 



530 535 540 

Ala Asn Gin Glu Glu Pro Glu Ala Arg Val Pro Ala Gin Ser Glu Ser 
545 550 555 560 

Val Arg Arg Pro Trp Glu Lys Asn Ser Thr Thr Leu Pro Arg Met Lys 

565 570 575 

Ser Ser Ser Ser Val Thr Thr Ser Glu Thr Gin Pro Cys Thr Pro Ser 

580 585 590 

Ser Ser Asp Tyr Ser Asp Leu Gin Arg Val Lys Gin Glu Leu Leu Glu 

595 600 605 

Glu Val Lys Lys Glu Leu Gin Lys Val Lys Glu Glu lie lie Glu Ala 

610 615 620 

Phe Val Gin Glu Leu Arg Lys Arg Gly Ser Pro 
625 630 635 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
<ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1326 
(D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TCC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 2 40 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 8E 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 HO 



GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 62 4 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 67 2 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCT CAA GCT TCA ATG GCT GCC ATC CGG AAG AAA 7 68 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ala Ala He Arg Lys Lys 
245 250 255 

CTG GTG ATT GTT GGT GAT GGA GCC TGT GGA AAG AC A TGC TTG CTC ATA 816 
Leu Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He 
260 265 270 

GTC TTC AGC AAG GAC CAG TTC CCA GAG GTG TAT GTG CCC ACA GTG TTT 864 
Val Phe Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe 
275 280 285 

GAG AAC TAT GTG GCA GAT ATC GAG GTG GAT GGA AAG CAG GTA GAG TTG 912 
Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 
290 295 300 

GCT TTG TGG GAC ACA GCT GGG CAG GAA GAT TAT GAT CGC CTG AGG CCC 960 
Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro 
305 310 315 320 

CTC TCC TAC CCA GAT ACC GAT GTT ATA CTG ATG TGT TTT TCC ATC GAC 1008 
Leu Ser Tyr Pro Asp Thr Asp Val lie Leu Met Cys Phe Ser He Asp 
325 330 335 

AGC CCT GAT AGT TTA GAA AAC ATC CCA GAA AAG TGG ACC CCA GAA GTC 10 56 



Ser Pro Asp 



AAG CAT TTC 
Lys His Phe 
355 

GAT CTT CGG 
Asp Leu Arg 
370 

CAG GAG CCG 

Gin Glu Pro 
385 

GGC GCT TTT 

Gly Ala Phe 



AGA GAG GTT 
Arg Glu Val 



GGG AAG AAA 
Gly Lys Lys 
435 



Ser Leu Glu 
340 

TGT CCC AAC 
Cys Pro Asn 

AAT GAT GAG 
Asn Asp Glu 



GTG AAA CCT 
Val Lys Pro 
390 

GGG TAC ATG 
Gly Tyr Met 
405 

TTT GAA ATG 
Phe Glu Met 
420 

AAA TCT GGT 
Lys Ser Gly 



Asn lie Pro 
345 

GTG CCC ATC 
Val Pro He 
360 

CAC ACA AGG 
His Thr Arg 
375 

GAA GAA GGC 
Glu Glu Gly 



GAG TGT TCA 
Glu Cys Ser 



GCT ACG AGA 
Ala Thr Arg 
425 

TGC CTT GTC 
Cys Leu Val 
440 



Glu Lys Trp 



ATC CTG GTT 
He Leu Val 



CGG GAG CTA 
Arg Glu Leu 
380 

AGA GAT ATG 
Arg Asp Met 
395 

GCA AAG ACC 
Ala Lys Thr 
410 

GCT GCT CTG 
Ala Ala Leu 



TTG TGA 
Leu 



Thr Pro Glu 
350 

GGG AAT AAG 
Gly Asn Lys 
365 

GCC AAG ATG 
Ala Lys Met 



GCA AAC AGG 
Ala Asn Arg 



AAA GAT GGA 
Lys Asp Gly 
415 

CAA GCT AGA 
Gin Ala Arg 



Val 



AAG 1104 
Lys 

AAG 1152 
Lys 



ATT 12 00 

He 

400 

GTG 1248 
Val 



CGT 1296 
Arg 



1329 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 



130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Met Ala Ala He Arg Lys Lys 

245 250 255 

Leu Val He Val Gly Asp Gly Ala Cys Gly Lys Thr Cys Leu Leu He 

260 265 270 

Val Phe Ser Lys Asp Gin Phe Pro Glu Val Tyr Val Pro Thr Val Phe 

275 280 285 

Glu Asn Tyr Val Ala Asp He Glu Val Asp Gly Lys Gin Val Glu Leu 

290 295 300 

Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp Arg Leu Arg Pro 
305 310 315 320 

Leu Ser Tyr Pro Asp Thr Asp Val He Leu Met Cys Phe Ser He Asp 

325 330 335 

Ser Pro Asp Ser Leu Glu Asn He Pro Glu Lys Trp Thr Pro Glu Val 

340 345 350 

Lys His Phe Cys Pro Asn Val Pro He He Leu Val Gly Asn Lys Lys 

355 360 365 

Asp Leu Arg Asn Asp Glu His Thr Arg Arg Glu Leu Ala Lys Met Lys 

370 375 380 

Gin Glu Pro Val Lys Pro Glu Glu Gly Arg Asp Met Ala Asn Arg He 
385 390 395 400 

Gly Ala Phe Gly Tyr Met Glu Cys Ser Ala Lys Thr Lys Asp Gly Val 

405 410 415 

Arg Glu Val Phe Glu Met Ala Thr Arg Ala Ala Leu Gin Ala Arg Arg 

420 425 430 

Gly Lys Lys Lys Ser Gly Cys Leu Val Leu 
435 440 

(2} INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1140 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...1137 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



ATG GAC CAT TAT GAT TCT CAG CAA ACC AAC GAT TAC ATG CAG CCA GAA 



Met Asp His Tyr Asp Ser Gin Gin Thr Asn Asp Tyr Met Gin Pro Glu 
1 5 10 15 

GAG GAC TGG GAC CGG GAC CTG CTC CTG GAC CCG GCC TGG GAG AAG CAG 96 
Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gin 
20 25 30 

CAG AGA AAG ACA TTC ACG GCA TGG TGT AAC TCC CAC CTC CGG AAG GCG 144 
Gin Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala 
35 40 45 

GGG ACA CAG ATC GAG AAC ATC GAA GAG GAC TTC CGG GAT GGC CTG AAG 192 
Gly Thr Gin He Glu Asn He Glu Glu Asp Phe Arg Asp Gly Leu Lys 
50 55 60 

CTC ATG CTG CTG CTG GAG GTC ATC TCA GGT GAA CGC TTG GCC AAG CCA 240 
Leu Met Leu Leu Leu Glu Val He Ser Gly Glu Arg Leu Ala Lys Pro 
65 70 75 80 

GAG CGA GGC AAG ATG AGA GTG CAC AAG ATC TCC AAC GTC AAC AAG GCC 2 88 

Glu Arg Gly Lys Met Arg Val His Lys He Ser Asn Val Asn Lys Ala 
85 90 95 

CTG GAT TTC ATA GCC AGC AAA GGC GTC AAA CTG GTG TCC ATC GGA GCC 3 36 

Leu Asp Phe He Ala Ser Lys Gly Val Lys Leu Val Ser He Gly Ala 
100 105 HO 

GAA GAA ATC GTG GAT GGG AAT GTG AAG ATG ACC CTG GGC ATG ATC TGG 3 84 

Glu Glu He Val Asp Gly Asn Val Lys Met Thr Leu Gly Met He Trp 
115 120 125 

ACC ATC ATC CTG CGC AGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG 432 
Thr He He Leu Arg Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys 
130 135 140 

GGC GAG GAG CTG TTC ACC GGG GTC GTG CCC ATC CTG GTC GAG CTG GAC 4 80 

Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
145 150 155 160 

GGC GAC GTA AAC G5C CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC 528 
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 
165 170 175 

GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC 57 6 

Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly 
180 185 190 

AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC 62 4 

Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly 
195 200 205 

GTG CAG TG2 TTC AG 2 CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC 67 2 

Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe 
210 215 220 

TTC AAG TCC GCC ATG CCC GAA G3C TAC GTC CAG GAG CGC ACC ATC TTC 72 0 

Phe Lys Ser Ala Met Pre Glu Gly Tyr Val Gin Glu Arg Thr He Phe 
225 230 235 240 



TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG 7 68 

Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 
245 250 255 

GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG 816 
Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys 
260 265 270 

GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC 864 
Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 
275 280 285 

CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG 912 
His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val 
290 295 300 

AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC 960 
Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala 
305 310 315 320 

GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG 1008 
Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu 
325 330 335 

CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC 1056 
Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 
340 345 350 

AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC 1104 
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 
355 360 365 

GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 1140 
Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
370 375 



(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Met Asp His Tyr Asp Ser Gin Gin Thr Asn Asp Tyr Met Gin Pro Glu 

15 10 15 

Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gin 

20 25 30 

Gin Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala 

35 40 45 

Gly Thr Gin lie Glu Asn lie Glu Glu Asp Phe Arg Asp Gly Leu Lys 



50 55 6C 

Leu Met Leu Leu Leu Glu Val lie Ser Gly Glu Arg Leu Ala Lys Pro 
65 70 75 80 

Glu Arg Gly Lys Met Arg Val His Lys lie Ser Asn Val Asn Lys Ala 

85 90 95 

Leu Asp Phe lie Ala Ser Lys Gly Val Lys Leu Val Ser lie Gly Ala 

100 105 110 

Glu Glu He Val Asp Gly Asn Val Lys Met Thr Leu Gly Met He Trp 

115 120 125 

Thr He He Leu Arg Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys 

130 135 140 

Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp 
145 150 155 160 

Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly 

165 170 175 

Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly 

180 185 190 

Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly 

195 200 205 

Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe 

210 215 220 

Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe 
225 230 235 240 

Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu 

245 250 255 

Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys 

260 265 270 

Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser 

275 280 285 

His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val 

290 295 300 

Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala 
305 310 315 320 

Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu 

325 330 335 

Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro 

340 345 350 

Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala 

355 360 365 

Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
370 375 

(2) INFORMATION FOR SEQ ID NO:130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1...3513 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 0 : 



/<?<r> 



ATG GT3 AGC AAG GGC GAG GAG CTG TT: ACC GGG GTG GTG CCC ATC CTG 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GO: GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCC ATG AAC GCC CCC GAG CGG CAG CCC CAA CCC 7 68 

Gly Leu Arg Ser Arg Ala Met Asn Ala Pro Glu Arg Gin Pro Gin Pro 
245 250 255 

GAC GGC GGG GAC GCC CCA GGC CAC GAG CCT GGG GGC AGC CCC CAA GAC 816 
Asp Gly Gly Asp Ala Pro Gly His Glu Pro Gly Gly Ser Pro Gin Asp 
260 265 270 

GAG CTT GAC TTC TCC ATC CTC TTC GAC TAT GAG TAT TTG AAT CCG AAC 864 
Glu Leu Asp Phe Ser He Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn 
275 280 285 

GAA GAA GAG CCG AAT GCA CAT AAG GTC GCC AGC CCA CCC TCC GGA CCC 912 
Glu Glu Glu Pro Asn Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 
290 295 300 

GCA TAC CCC GAT GAT GTA ATG GAC TAT GGC CTC AAG CCA TAC AGC CCC 960 
Ala Tyr Pro Asp Asp Val Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro 
305 310 315 320 

CTT GCT AGT CTC TCT GGC GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT 1008 
Leu Ala Ser Leu Ser Gly Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp 
325 330 335 

AGG GTA GGG CCG CAG AAG TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC 1056 
Arg Val Gly Pro Gin Lys Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala 
340 345 350 

TOG GGC CTG AGC CCT CGG ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC 1104 
Ser Gly Leu Ser Pro Arg He Glu He Thr Pro Ser His Glu Leu He 
355 360 365 

CAG GCA GTG GGG CCC CTC CGC ATG AGA GAC GCG GGC CTC CTG GTG GAG 1152 
Gin Ala Val Gly Pro Leu Arg Met Arg Asp Ala Gly Leu Leu Val Glu 
370 375 380 

CAG CCT CCC CTG GCC GGG GTG GCC GCC AGC CCG AGG TTC ACC CTG CCC 12 00 
Gin Pro Pro Leu Ala Gly Val Ala Ala Ser Pro Arg Phe Thr Leu Pro 
385 390 395 400 

GTG CCC GGC TTC GAG GGC TAC CGC GAG CCG CTT TGC TTG AGC CCC GCT 12 48 
Val Pro Gly Phe Glu Gly Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala 
405 410 415 

A'^r AG: GGC TCC TCT GCC AGC TTC ATT TCT GAC ACC TTC TCC CCC TAC 12 96 

Ser Ser Gly Ser Ser Ala Ser Phe He Ser Asp Thr Phe Ser Pro Tyr 
420 425 430 

ACC TCG CCC TGC GTC TCG CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT 1344 
Thr Ser Pro Cys Val Ser Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys 
435 440 445 

CCG CAG TTT CAA AAC ATC CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA 13 92 
Pro Gin Phe Gin Asn He Pro Ala His Tyr Ser Pro Arg Thr Ser Pro 
450 455 460 



ATA ATG TCA CCT CGA ACC AGO CTC GCC GAG GAC AGO TGC CTG GGC CGC 144 0 

He Met Ser Pro Arg Thr Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg 
465 470 475 480 

CAC TCG CCG GTG CCC CGT CCG GCC TCC CGC TCC TCA TCG CCT GGT GCC 14 88 

His Ser Pro Val Pro Arg Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala 
485 490 495 

AAG CGG AGG CAT TCG TGC GCC GAG GCC TTG GTT GCC CTG CCG CCC GGA 153 6 
Lys Arg Arg His Ser Cys Ala Glu Ala Leu Val Ala Leu Pro Pro Gly 
500 505 510 

GCC TCA CCC CAG CGC TCC CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC 15 84 

Ala Ser Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Ser His 
515 520 525 

GTG GCA CCC CAG GAC CAC GGC TCC CCG GCT GGG TAC CCC CCT GTG GCT 1632 
Val Ala Pro Gin Asp His Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala 
530 535 540 

GGC TCT GCC GTG ATC ATG GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG 16 80 

Gly Ser Ala Val He Met Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser 
545 550 555 560 

CCT TGT GGG ATC CCC CCC AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG 1728 
Pro Cys Gly He Pro Pro Lys Met Trp Lys Thr Ser Pro Asp Pro Ser 
565 570 575 

CCG GTG TCT GCC GCC CCA TCC AAG GCC GGC CTG CCT CGC CAC ATC TAC 1776 
Pro Val Ser Ala Ala Pro Ser Lys Ala Gly Leu Pro Arg His He Tyr 
580 585 590 

CCG GCC GTG GAG TTC CTG GGG CCC TGC GAG CAG GGC GAG AGG AGA AAC 1824 
Pro Ala Val Glu Phe Leu Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn 
595 600 605 

TCG GCT CCA GAA TCC ATC CTG CTG GTT CCG CCC ACT TGG CCC AAG CCG 1872 
Ser Ala Pro Glu Ser He Leu Leu Val Pro Pro Thr Trp Pro Lys Pro 
610 615 620 

CTG GTG CCT GCC ATT CCC ATC T3C AGC ATC CCA GTG ACT GCA TCC CTC 1920 
Leu Val Pro Ala He Pro He Cys Ser He Pro Val Thr Ala Ser Leu 
625 630 635 640 

CCT CCA CTT GAG TGG CCG CTG TCC AGT CAG TCA GGC TCT TAC GAG CTG 196 8 

Pro Pro Leu Glu Trp Pro Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu 
645 650 655 

CGG ATC GAG GTG CAG CCC AAG CCA CAT CAC CGG GCC CAC TAT GAG ACA 2016 
Arg He Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr 
660 665 670 

GAA GGC AGC CGA GGG GCT GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG 2064 
Glu Gly Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val 
675 680 685 

GTT CAG CTC CAT GGC TAC ATG GAA AAC AAG CCT CT3 GGA CTT CAG ATC 2112 



Val Gin Leu His Gly Tyr Met Glu Asn Lys Pro Leu Gly Leu Gin lie 
690 69S 700 

TTC ATT GGG AC A GCT GAT GAG CGG ATC CTT AAG CCG CAC GCC TTC TAC 2160 
Phe He Gly Thr Ala Asp Glu Arg He Leu Lys Pro His Ala Phe Tyr 
705 710 715 720 

CAG GTG CAC CGA ATC ACG GGG AAA ACT GTC ACC ACC ACC AGC TAT GAG 22 08 
Gin Val His Arg He Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu 
725 730 735 

AAG ATA GTG GGC AAC ACC AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA 2256 
Lys He Val Gly Asn Thr Lys Val Leu Glu He Pro Leu Glu Pro Lys 
740 745 750 

AAC AAC ATG AGG GCA ACC ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA 2304 
Asn Asn Met Arg Ala Thr He Asp Cys Ala Gly He Leu Lys Leu Arg 
755 760 765 

AAC GCC GAC ATT GAG CTG CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG 2352 
Asn Ala Asp He Glu Leu Arg Lys Gly Glu Thr Asp lie Gly Arg Lys 
770 775 780 

AAC ACG CGG GTG AGA CTG GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT 24 00 
Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 
785 790 795 800 

GGC AGA ATC GTC TCT TTA CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC 24 48 
Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 
805 810 815 

CAG CGA TCT GCT CAC GAG CTG CCC ATG GTT GAA AGA CAA GAC AC A GAC 24 96 
Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 
820 825 830 

AGC TGC CTG GTC TAT GGC GGC CAG CAA ATG ATC CTC ACG GGG CAG AAC 2 5 44 
Ser Cys Leu Val Tyr Gly Gly Gin Gin Met lie Leu Thr Gly Gin Asn 
835 840 845 

TTT ACA TCC GAG TCC AAA GTT GTG TTT ACT GAG AAG ACC ACA GAT GGA 2592 
Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 
850 855 860 

CAG CAA ATT TGG GAG ATG GAA GCC ACG GTG GAT AAG GAC AAG AGC CAG 2 640 
Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 
865 870 875 880 

CCC AAC ATG CTT TTT GTT GAG ATC CCT GAA TAT CGG AAC AAG CAT ATC 2 688 
Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 
885 890 895 

CGC ACA CCT GTA AAA GTG AAC TTC TAC GTC ATC AAT GGG AAG AGA AAA 27 3 6 
Arg Thr Pro Val Lys Val Asn Phe Tyr Val He Asn Gly Lys Arg Lys 
900 905 910 

CGA AGT CAG CCT CAG CAC TTT ACC TAC CAC CCA GTC CCA GCC ATC AAG 27 84 
Arg Ser Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala lie Lys 
915 920 925 



ACG GAG CCC ACG GAT GAA TAT GAC GCC ACT CTG ATC TGC AGO CCC ACC 2832 

Thr Glu Pre Thr Asp Glu Tyr Asp Pro Thr Leu lie Cys Ser Pro Thr 
930 935 940 

CAT GGA GGC CTG GGG AGC CAG CCT TAC TAC CCC CAG CAC CCG ATG GTG 2 880 

His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 

945 950 955 960 

GCC GAG TCC CCC TCC TGC CTC GTG GCC ACC ATG GCT CCC TGC CAG CAG 292 8 

Ala Glu Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 
965 970 975 

TTC CGC ACG GGG CTC TCA TCC CCT GAC GCC CGC TAC CAG CAA CAG AAC 2976 

Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 
980 985 990 

CCA GCG GCC GTA CTC TAC CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG 3024 

Pro Ala Ala Val Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu 
995 1000 1005 

CTG GGC TAT CAG CAG CCG GCC CTC ATG GCC GCC CCG CTG TCC CTT GCG 3072 

Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 
1010 1015 1020 

GAC GCT CAC CGC TCT GTG CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC 312 0 

Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 

1025 1030 1035 1040 

TCA GCC CTG CTC CAC CCC TCT CCG ACC AAC CAG CAG GCC TCG CCT GTG 3168 

Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 
1045 1050 1055 

ATC CAC TAC TCA CCC ACC AAC CAG CAG CTG CGC TGC GGA AGC CAC CAG 3216 

lie His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Cys Gly Ser His Gin 
1060 1065 1070 

GAG TTC CAG CAC ATC ATG TAC TGC GAG AAT TTC GCA CCA GGC ACC ACC 32 64 

Glu Phe Gin His lie Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 
1075 1080 1085 

AGA CCT GGC CCG CCC CCG GTC AGT CAA GGT CAG AGG CTG AGC CCG GGT 3312 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Gly 
1090 1095 1100 

TCC TAC CCC AC A GTC ATT CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC 33 60 

Ser Tyr Pro Thr Val lie Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 

1105 1110 1115 1120 

GCC AAA AAC GGA CCC CCG GTC AGT GAC CAA AAG GAA GTA TTA CCT GCG 34 08 

Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 
1125 1130 1135 

GGG GTG ACC ATT AAA CAG GAG CAG AAC TTG GAC CAG ACC TAC TTG GAT 3456 

Gly Val Thr lie Lys Gin Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp 
1140 1145 1150 

GAT GTT AAT GAA ATT ATC AGG AAG GAG TTT TCA GGA CCT CCT GCC AGA 3 504 



Asp Val Asn Glu lie He Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg 
1155 1160 1165 

AAT CAG ACG TAA 3516 
Asn Gin Thr 
1170 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arc Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Met Asn Ala Pro Glu Arg Gin Pro Gin Pro 

245 250 255 

Asp Gly Gly Asp Ala Pro Gly His Glu Pro Gly Gly Ser Pro Gin Asp 

260 265 270 

Glu Leu Asp Phe Ser He Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn 

275 280 285 

Glu Glu Glu Pro Asn Ala His Lys Val Ala Ser Pro Pro Ser Gly Pro 



290 295 300 

Ala Tyr Pro Asp Asp Val Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro 
305 310 315 320 

Leu Ala Ser Leu Ser Gly Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp 

325 330 335 

Arg Val Gly Pro Gin Lys Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala 

340 345 350 

Ser Gly Leu Ser Pro Arg He Glu He Thr Pro Ser His Glu Leu He 

355 360 365 

Gin Ala Val Gly Pro Leu Arg Met Arg Asp Ala Gly Leu Leu Val Glu 

370 375 380 

Gin Pro Pro Leu Ala Gly Val Ala Ala Ser Pro Arg Phe Thr Leu Pre 
385 390 395 400 

Val Pro Gly Phe Glu Gly Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala 

405 410 415 

Ser Ser Gly Ser Ser Ala Ser Phe He Ser Asp Thr Phe Ser Pro Tyr 

420 425 430 

Thr Ser Pro Cys Val Ser Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys 

435 440 445 

Pro Gin Phe Gin Asn He Pro Ala His Tyr Ser Pro Arg Thr Ser Pro 

450 455 460 

He Met Ser Pro Arg Thr Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg 
465 470 475 480 

His Ser Pro Val Pro Arg Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala 

485 490 495 

Lys Arg Arg His Ser Cys Ala Glu Ala Leu Val Ala Leu Pro Pro Gly 

500 505 510 

Ala Ser Pro Gin Arg Ser Arg Ser Pro Ser Pro Gin Pro Ser Ser His 

515 520 525 

Val Ala Pro Gin Asp His Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala 

530 535 540 

Gly Ser Ala Val lie Met Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser 
545 550 555 560 

Pro Cys Gly He Pro Pro Lys Met Trp Lys Thr Ser Pro Asp Pro Ser 

565 570 575 

Pro Val Ser Ala Ala Pro Ser Lys Ala Gly Leu Pro Arg His He Tyr 

580 585 590 

Pro Ala Val Glu Phe Leu Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn 

595 600 605 

Ser Ala Pro Glu Ser He Leu Leu Val Pro Pro Thr Trp Pro Lys Pro 

610 615 620 

Leu Val Pro Ala He Pro He Cys Ser He Pro Val Thr Ala Ser Leu 
625 630 635 640 

Pro Pro Leu Glu Trp Pro Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu 

645 650 655 

Arg He Glu Val Gin Pro Lys Pro His His Arg Ala His Tyr Glu Thr 

660 665 670 

Glu Gly Ser Arg Gly Ala Val Lys Ala Pro Thr Gly Gly His Pro Val 

675 680 685 

Val Gin Leu His Gly Tyr Met Glu Asn Lys Pro Leu Gly Leu Gin He 

690 695 700 

Phe He Gly Thr Ala Asp Glu Arg He Leu Lys Pro His Ala Phe Tyr 
705 710 715 720 

Gin Val His Arg He Thr Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu 

725 730 735 

Lys He Val Gly Asn Thr Lys Val Leu Glu lie Pro Leu Glu Pro Lys 

740 745 750 

Asn Asn Met Arg Ala Thr He Asp Cys Ala Gly He Leu Lys Leu Arg 



755 760 765 

Asn Ala Asp He Glu Leu Arg Lys Gly Glu Thr Asp He Gly Arg Lys 

770 775 780 

Asn Thr Arg Val Arg Leu Val Phe Arg Val His He Pro Glu Ser Ser 
785 790 795 800 

Gly Arg He Val Ser Leu Gin Thr Ala Ser Asn Pro He Glu Cys Ser 

805 810 815 

Gin Arg Ser Ala His Glu Leu Pro Met Val Glu Arg Gin Asp Thr Asp 

820 825 830 

Ser Cys Leu Val Tyr Gly Gly Gin Gin Met He Leu Thr Gly Gin Asn 

835 840 845 

Phe Thr Ser Glu Ser Lys Val Val Phe Thr Glu Lys Thr Thr Asp Gly 

850 855 860 

Gin Gin He Trp Glu Met Glu Ala Thr Val Asp Lys Asp Lys Ser Gin 
865 870 875 880 

Pro Asn Met Leu Phe Val Glu He Pro Glu Tyr Arg Asn Lys His He 

885 890 895 

Arg Thr Pro Val Lys Val Asn Phe Tyr Val lie Asn Gly Lys Arg Lys 

900 905 910 

Arg Ser Gin Pro Gin His Phe Thr Tyr His Pro Val Pro Ala He Lys 

915 920 925 

Thr Glu Pro Thr Asp Glu Tyr Asp Pro Thr Leu He Cys Ser Pro Thr 

930 935 940 

His Gly Gly Leu Gly Ser Gin Pro Tyr Tyr Pro Gin His Pro Met Val 
945 950 955 960 

Ala Glu Ser Pro Ser Cys Leu Val Ala Thr Met Ala Pro Cys Gin Gin 

965 970 975 

Phe Arg Thr Gly Leu Ser Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn 

980 985 990 

Fro Ala Ala Val Leu Tyr Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu 

995 1000 1005 

Leu Gly Tyr Gin Gin Pro Ala Leu Met Ala Ala Pro Leu Ser Leu Ala 

1010 1015 1020 

Asp Ala His Arg Ser Val Leu Val His Ala Gly Ser Gin Gly Gin Ser 
025 1030 1035 1040 

Ser Ala Leu Leu His Pro Ser Pro Thr Asn Gin Gin Ala Ser Pro Val 

1045 1050 1055 

He His Tyr Ser Pro Thr Asn Gin Gin Leu Arg Cys Gly Ser His Gin 

1060 1065 1070 

Glu Phe Gin His He Met Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr 

1075 1080 1085 

Arg Pro Gly Pro Pro Pro Val Ser Gin Gly Gin Arg Leu Ser Pro Gly 

1090 1095 1100 

Ser Tyr Pro Thr Val lie Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala 
105 1110 1115 1120 

Ala Lys Asn Gly Pro Pro Val Ser Asp Gin Lys Glu Val Leu Pro Ala 

1125 1130 1135 

Gly Val Thr lie Lys Gin Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp 

1140 1145 1150 

Asp Val Asn Glu lie lie Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg 

1155 1160 1165 

Asn Gin Thr 
1170 

(2) INFORMATION FOR SEQ ID NO: 132: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 3546 base pairs 



(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3543 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

ATG AAC GCC CCC GAG CGG CAG CCC CAA CCC GAC GGC GGG GAC GCC CCA 48 
Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 
15 10 15 

GGC CAC GAG CCT GGG GGC AGC CCC CAA GAC GAG CTT GAC TTC TCC ATC 96 
Gly His Glu Pro Gly Gly Ser Pro Gin Asp Glu Leu Asp Phe Ser lie 
20 25 30 

CTC TTC GAC TAT GAG TAT TTG AAT CCG AAC GAA GAA GAG CCG AAT GCA 144 
Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn Glu Glu Glu Pro Asn Ala 
35 40 45 

CAT AAG GTC GCC AGC CCA CCC TCC GGA CCC GCA TAC CCC GAT GAT GTA 192 
His Lys Val Ala Ser Pro Pro Ser Gly Pro Ala Tyr Pro Asp Asp Val 
50 55 60 

ATG GAC TAT GGC CTC AAG CCA TAC AGC CCC CTT GCT AGT CTC TCT GGC 240 
Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro Leu Ala Ser Leu Ser Gly 
65 70 75 80 

GAG CCC CCC GGC CGA TTC GGA GAG CCG GAT AGG GTA GGG CCG CAG AAG 288 
Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp Arg Val Gly Pro Gin Lys 
85 90 95 

TTT CTG AGC GCG GCC AAG CCA GCA GGG GCC TCG GGC CTG AGC CCT CGG 33 6 

Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala Ser Gly Leu Ser Pro Arg 
100 105 110 

ATC GAG ATC ACT CCG TCC CAC GAA CTG ATC CAG GCA GTG GGG CCC CTC 384 
lie Glu He Thr Pro Ser His Glu Leu He Gin Ala Val Gly Pro Leu 
115 120 125 

CGC ATG AGA GAC GCG GGC CTC CTG GTG GAG CAG CCT CCC CTG GCC GGG 432 
Arg Met Arg Asp Ala Gly Leu Leu Val Glu Gin Pro Pro Leu Ala Gly 
130 135 140 

GTG GCC GCC AGC CCG AGG TTC ACC CTG CCC GTG CCC GGC TTC GAG GGC 480 
Val Ala Ala Ser Pro Arg Phe Thr Leu Pro Val Pro Gly Phe Glu Gly 
145 150 155 160 

TAC CGC GAG CCG CTT TGC TTG AGC CCC GCT AGC AGC GGC TCC TCT GCC 528 
Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala Ser Ser Gly Ser Ser Ala 
165 170 175 

AGC TTC ATT TCT GAC ACC TTC TCC CCC TAC ACC TCG CCC TGC GTC TCG 576 



/9 9 



Ser Phe lie Ser Asp Thr Phe Ser Pro Tyr Thr Ser Pro Cys Val Ser 
180 185 190 

CCC AAT AAC GGC GGG CCC GAC GAC CTG TGT CCG CAG TTT CAA AAC ATC 624 
Pro Asn Asn Gly Gly Pro Asp Asp Leu Cys Pro Gin Phe Gin Asn lie 
195 200 205 

CCT GCT CAT TAT TCC CCC AGA ACC TCG CCA ATA ATG TCA CCT CGA ACC 672 
Pro Ala His Tyr Ser Pro Arg Thr Ser Pro He Met Ser Pro Arg Thr 
210 215 220 

AGC CTC GCC GAG GAC AGC TGC CTG GGC CGC CAC TCG CCC GTG CCC CGT 720 
Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg His Ser Pro Val Pro Arg 
225 230 235 240 

CCG GCC TCC CGC TCC TCA TCG CCT GGT GCC AAG CGG AGG CAT TCG TGC 7 68 

Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg Arg His Ser Cys 
245 250 255 

GCC GAG GCC TTG GTT GCC CTG CCG CCC GGA GCC TCA CCC CAG CGC TCC 816 
Ala Glu Ala Leu Val Ala Leu Pro Pro Gly Ala Ser Pro Gin Arg Ser 
260 265 270 

CGG AGC CCC TCG CCG CAG CCC TCA TCT CAC GTG GCA CCC CAG GAC CAC 864 
Arg Ser Pro Ser Pro Gin Pro Ser Ser His Val Ala Pro Gin Asp His 
275 280 285 

GGC TCC CCG GCT GGG TAC CCC CCT GTG GCT GGC TCT GCC GTG ATC ATG 912 
Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala Gly Ser Ala Val He Met 
290 295 300 

GAT GCC CTG AAC AGC CTC GCC ACG GAC TCG CCT TGT GGG ATC CCC CCC 960 
Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser Pro Cys Gly He Pro Pro 
305 310 315 320 

AAG ATG TGG AAG ACC AGC CCT GAC CCC TCG CCG GTG TCT GCC GCC CCA 1008 
Lys Met Trp Lys Thr Ser Pro Asp Pro Ser Pro Val Ser Ala Ala Pro 
325 330 335 

TCC AAG GCC GGC CTG CCT CGC CAC ATC TAC CCG GCC GTG GAG TTC CTG 10 56 

Ser Lys Ala Gly Leu Pro Arg His He Tyr Pro Ala Val Glu Phe Leu 
340 345 350 

GGG CCC TGC GAG CAG GGC GAG AGG AGA AAC TCG GCT CCA GAA TCC ATC 1104 
Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn Ser Ala Pro Glu Ser He 
355 360 365 

CTG CTG GTT CCG CCC ACT TGG CCC AAG CCG CTG GTG CCT GCC ATT CCC 1152 
Leu Leu Val Pro Pro Thr Trp Pro Lys Pro Leu Val Pro Ala He Pro 
370 375 380 

ATC TGC AGC ATC CCA. GTG ACT GCA TCC CTC CCT CCA CTT GAG TGG CCG 12 00 

He Cys Ser He Pro Val Thr Ala Ser Leu Pro Pro Leu Glu Trp Pro 
385 390 395 400 

CTG TCC AGT CAG TCA GGC TCT TAC GAG CTG CGG ATC GAG GTG CAG CCC 12 4 8 

Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu Arg He Glu Val Gin Pro 
405 410 415 



J?CPO 



AAG CCA CAT CAC CGG GCC CAC TAT GAG ACA GAA GGC AGC CGA GGG GCT 1296 

Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly Ser Arg Gly Ala 
420 425 430 

GTC AAA GCT CCA ACT GGA GGC CAC CCT GTG GTT CAG CTC CAT GGC TAC 1344 

Val Lys Ala Pro Thr Gly Gly His Pro Val Val Gin Leu His Gly Tyr 
435 440 445 

ATG GAA AAC AAG CCT CTG GGA CTT CAG ATC TTC ATT GGG ACA GCT GAT 13 92 

Met Glu Asn Lys Pro Leu Gly Leu Gin lie Phe He Gly Thr Ala Asp 
450 455 460 

GAG CGG ATC CTT AAG CCG CAC GCC TTC TAC CAG GTG CAC CGA ATC ACG 1440 

Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val His Arg He Thr 
465 470 475 480 

GGG AAA ACT GTC ACC ACC ACC AGC TAT GAG AAG ATA GTG GGC AAC ACC 1488 

Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys He Val Gly Asn Thr 

485 490 495 

AAA GTC CTG GAG ATC CCC TTG GAG CCC AAA AAC AAC ATG AGG GGA ACC 1536 

Lys Val Leu Glu He Pro Leu Glu Pro Lys Asn Asn Met Arg Ala Thr 
500 505 510 

ATC GAC TGT GCG GGG ATC TTG AAG CTT AGA AAC GCC GAC ATT GAG CTG 1584 

lie Asp Cys Ala Gly He Leu Lys Leu Arg Asn Ala Asp He Glu Leu 
515 520 525 

CGG AAA GGC GAG ACG GAC ATT GGA AGA AAG AAC ACG CGG GTG AGA CTG 1632 

Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr Arg Val Arg Leu 
530 535 540 

GTT TTC CGA GTT CAC ATC CCA GAG TCC AGT GGC AGA ATC GTC TCT TTA 1680 

Val Phe Arg Val His He Pro Glu Ser Ser Gly Arg He Val Ser Leu 
545 550 555 560 

CAG ACT GCA TCT AAC CCC ATC GAG TGC TCC CAG CGA TCT GCT CAC GAG 1728 

Gin Thr Ala Ser Asn Pro He Glu Cys Ser Gin Arg Ser Ala His Glu 

565 570 575 

CTG CCC ATG GTT GAA AGA CAA GAC ACA GAC AGC TGC CTG GTC TAT GGC 1776 

Leu Pro Met Val Glu Arg Gin Asp Thr Asp Ser Cys Leu Val Tyr Gly 
580 585 590 

GGC CAG CAA ATG ATC CTC ACG GGG CAG AAC TTT ACA TCC GAG TCC AAA 1824 

Gly Gin Gin Met He Leu Thr Gly Gin Asn Phe Thr Ser Glu Ser Lys 
595 600 605 

GTT GTG TTT ACT GAG AAG ACC ACA GAT GGA CAG CAA ATT TGG GAG ATG 1872 

Val Val Phe Thr Glu Lys Thr Thr Asp Gly Gin Gin He Trp Glu Met 
610 615 620 

GAA GCC ACG GTG GAT AAG GAC AAG AGC CAG CCC AAC ATG CTT TTT GTT 192 0 

Glu Ala Thr Val Asp Lys Asp Lys Ser Gin Pro Asn Met Leu Phe Val 
625 630 635 640 

GAG ATC CCT GAA TAT CGG AAC AAG CAT ATC CGC ACA CCT GTA AAA GTG 1968 



Glu He Pro Glu Tyr Arg Asn Lys His He Arg Thr Pro Val Lys Val 
645 650 655 

AAC TTC TAC GTC ATC AAT GGG AAG AGA AAA CGA AGT CAG CCT CAG CAC 2016 
Asn Phe Tyr Val He Asn Gly Lys Arg Lys Arg Ser Gin Pro Gin His 
660 665 670 

TTT ACC TAC CAC CCA GTC CCA GCC ATC AAG ACG GAG CCC ACG GAT GAA 2064 
Phe Thr Tyr His Pro Val Pro Ala He Lys Thr Glu Pro Thr Asp Glu 
675 680 685 

TAT GAC CCC ACT CTG ATC TGC AGC CCC ACC CAT GGA GGC CTG GGG AGC 2112 
Tyr Asp Pro Thr Leu He Cys Ser Pro Thr His Gly Gly Leu Gly Ser 
690 695 700 

CAG CCT TAC TAC CCC CAG CAC CCG ATG GTG GCC GAG TCC CCC TCC TGC 2160 
Gin Pro Tyr Tyr Pro Gin His Pro Met Val Ala Glu Ser Pro Ser Cys 
705 710 715 720 

CTC GTG GCC ACC ATG GCT CCC TGC CAG CAG TTC CGC ACG GGG CTC TCA 2208 
Leu Val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg Thr Gly Leu Ser 
725 730 735 

TCC CCT GAC GCC CGC TAC CAG CAA CAG AAC CCA GCG GCC GTA CTC TAC 2256 
Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn Pro Ala Ala Val Leu Tyr 
740 745 750 

CAG CGG AGC AAG AGC CTG AGC CCC AGC CTG CTG GGC TAT CAG CAG CCG 2304 
Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu Leu Gly Tyr Gin Gin Pro 
755 760 765 

GCC CTC ATG GCC GCC CCG CTG TCC CTT GCG GAC GCT CAC CGC TCT GTG 23 52 
Ala Leu Met Ala Ala Pro Leu Ser Leu Ala Asp Ala His Arg Ser Val 
770 775 780 

CTG GTG CAC GCC GGC TCC CAG GGC CAG AGC TCA GCC CTG CTC CAC CCC 2400 
Leu Val His Ala Gly Ser Gin Gly Gin Ser Ser Ala Leu Leu His Pro 
785 790 795 800 

TCT CCG ACC AAC CAG CAG GCC TCG CCT GTG ATC CAC TAC TCA CCC ACC 2448 
Ser Pro Thr Asn Gin Gin Ala Ser Pro Val He His Tyr Ser Pro Thr 
805 810 815 

AAC CAG CAG CTG CGC TGC GGA AGC CAC CAG GAG TTC CAG CAC ATC ATG 2496 
Asn Gin Gin Leu Arg Cys Gly Ser His Gin Glu Phe Gin His lie Met 
820 825 830 

TAC TGC GAG AAT TTC GCA CCA GGC ACC ACC AGA CCT GGC CCG CCC CCG 2544 
Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr Arg Pro Gly Pro Pro Pro 
835 840 845 

GTC AGT CAA GGT CAG AGG CTG AGZ CCG GGT TCC TAC CCC ACA GTC ATT 2 592 
Val Ser Gin Gly Gin Arg Leu Ser Pro Gly Ser Tyr Pro Thr Val He 
850 855 860 

CAG CAG CAG AAT GCC ACG AGC CAA AGA GCC GCC AAA AAC GGA CCC CCG 2640 
Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala Ala Lys Asn Gly Pro Pro 
865 870 875 880 



2oZ 



GTC AGT GAC CM AAG GAA GTA TTA CCT GCG GGG GTG ACC ATT AAA CAG 2 6 88 
Val Ser Asp Gin Lys Glu Val Leu Pro Ala Gly Val Thr lie Lys Gin 
885 890 895 

GAG CAG AAC TTG GAC CAG ACC TAC TTG GAT GAT GTT AAT GAA ATT ATC 2736 
Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp Asp Val Asn Glu lie He 
900 905 910 

AGG AAG GAG TTT TCA GGA CCT CCT GCC AGA AAT CAG ACG AGA ATT CTG 27 84 
Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg Asn Gin Thr Arg He Leu 
915 920 925 

CAG TCG ACG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG GTG 2832 
Gin Ser Thr Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val 
930 935 940 

AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 2880 
Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 
945 950 955 960 

CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 2928 
Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
965 970 975 

GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 2976 
Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 
980 985 990 

ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 3024 
Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
995 1000 1005 

TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC 3072 
Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
1010 1015 1020 

GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC 3120 
Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
1025 1030 1035 1040 

ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG 3168 
He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
1045 1050 1055 

TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC 3216 
Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 
1060 1065 1070 

TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC 32 64 
Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 
1075 1080 1085 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC 3 312 
Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
1090 1095 1100 

AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG 3360 
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Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 
1105 1110 1115 1120 

CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG 3408 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 
1125 1130 1135 

CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA 3456 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
1140 1145 1150 

GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 3504 
Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
1155 1160 1165 

GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 3546 
Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1170 1175 1180 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Met Asn Ala Pro Glu Arg Gin Pro Gin Pro Asp Gly Gly Asp Ala Pro 

15 10 15 

Gly His Glu Pro Gly Gly Ser Pro Gin Asp Glu Leu Asp Phe Ser He 

20 25 30 

Leu Phe Asp Tyr Glu Tyr Leu Asn Pro Asn Glu Glu Glu Pro Asn Ala 

35 40 45 

His Lys Val Ala Ser Pro Pro Ser Gly Pro Ala Tyr Pro Asp Asp Val 

50 55 60 

Met Asp Tyr Gly Leu Lys Pro Tyr Ser Pro Leu Ala Ser Leu Ser Gly 
65 70 75 80 

Glu Pro Pro Gly Arg Phe Gly Glu Pro Asp Arg Val Gly Pro Gin Lys 

85 90 95 

Phe Leu Ser Ala Ala Lys Pro Ala Gly Ala Ser Gly Leu Ser Pro Arg 

100 105 110 

He Glu He Thr Pro Ser His Glu Leu He Gin Ala Val Gly Pro Leu 

115 120 125 

Arg Met Arg Asp Ala Gly Leu Leu Val Glu Gin Pro Pro Leu Ala Gly 

130 135 140 

Val Ala Ala Ser Pro Arg Phe Thr Leu Pro Val Pro Gly Phe Glu Gly 
145 150 155 160 

Tyr Arg Glu Pro Leu Cys Leu Ser Pro Ala Ser Ser Gly Ser Ser Ala 

165 170 175 

Ser Phe He Ser Asp Thr Phe Ser Pro Tyr Thr Ser Pro Cys Val Ser 

180 185 190 

Pro Asn Asn Gly Gly Pre Asp Asp Leu Cys Pro Gin Phe Gin Asn He 
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195 200 205 

Pro Ala His Tyr Ser Pro Arg Thr Ser Pro lie Met Ser Pro Arg Thr 

210 215 220 

Ser Leu Ala Glu Asp Ser Cys Leu Gly Arg His Ser Pro Val Pro Arg 
225 230 235 240 

Pro Ala Ser Arg Ser Ser Ser Pro Gly Ala Lys Arg Arg His Ser Cys 

245 250 255 

Ala Glu Ala Leu Val Ala Leu Pro Pro Gly Ala Ser Pro Gin Arg Ser 

260 265 270 

Arg Ser Pro Ser Pro Gin Pro Ser Ser His Val Ala Pro Gin Asp His 

275 280 285 

Gly Ser Pro Ala Gly Tyr Pro Pro Val Ala Gly Ser Ala Val lie Met 

290 295 300 

Asp Ala Leu Asn Ser Leu Ala Thr Asp Ser Pro Cys Gly lie Pro Pro 
305 310 315 320 

Lys Met Trp Lys Thr Ser Pro Asp Pro Ser Pro Val Ser Ala Ala Pro 

325 330 335 

Ser Lys Ala Gly Leu Pro Arg His lie Tyr Pro Ala Val Glu Phe Leu 

340 345 350 

Gly Pro Cys Glu Gin Gly Glu Arg Arg Asn Ser Ala Pro Glu Ser lie 

355 360 365 

Leu Leu Val Pro Pro Thr Trp Pro Lys Pro Leu Val Pro Ala lie Pro 

370 375 380 

lie Cys Ser lie Pro Val Thr Ala Ser Leu Pro Pro Leu Glu Trp Pro 
385 390 395 400 

Leu Ser Ser Gin Ser Gly Ser Tyr Glu Leu Arg He Glu Val Gin Pro 

405 410 415 

Lys Pro His His Arg Ala His Tyr Glu Thr Glu Gly Ser Arg Gly Ala 

420 425 430 

Val Lys Ala Pro Thr Gly Gly His Pro Val Val Gin Leu His Gly Tyr 

435 440 445 

Met Glu Asn Lys Pro Leu Gly Leu Gin He Phe He Gly Thr Ala Asp 

450 455 460 

Glu Arg He Leu Lys Pro His Ala Phe Tyr Gin Val His Arg He Thr 
465 470 475 480 

Gly Lys Thr Val Thr Thr Thr Ser Tyr Glu Lys lie Val Gly Asn Thr 

485 490 495 

Lys Val Leu Glu He Pro Leu Glu Pro Lys Asn Asn Met Arg Ala Thr 

500 505 510 

lie Asp Cys Ala Gly lie Leu Lys Leu Arg Asn Ala Asp He Glu Leu 

515 520 525 

Arg Lys Gly Glu Thr Asp He Gly Arg Lys Asn Thr Arg Val Arg Leu 

530 535 540 

Val Phe Arg Val His He Pro Glu Ser Ser Gly Arg He Val Ser Leu 
545 550 555 560 

Gin Thr Ala Ser Asn Pro He Glu Cys Ser Gin Arg Ser Ala His Glu 

565 570 575 

Leu Pro Met Val Glu Arg Gin Asp Thr Asp Ser Cys Leu Val Tyr Gly 

580 585 590 

Gly Gin Gin Met He Leu Thr Gly Gin Asn Phe Thr Ser Glu Ser Lys 

595 600 605 

Val Val Phe Thr Glu Lys Thr Thr Asp Gly Gin Gin He Trp Glu Met 

610 615 620 

Glu Ala Thr Val Asp Lys Asp Lys Ser Gin Pro Asn Met Leu Phe Val 
625 630 635 640 

Glu He Pro Glu Tyr Arg Asn Lys His lie Arg Thr Pro Val Lys Val 

645 650 655 

Asn Phe Tyr Val lie Asn Gly Lys Arg Lys Arg Ser Gin Pro Gin His 



660 665 670 

Phe Thr Tyr His Pro Val Pro Ala lie Lys Thr Glu Pro Thr Asp Glu 

675 680 685 

Tyr Asp Pro Thr Leu lie Cys Ser Pro Thr His Gly Gly Leu Gly Ser 

690 695 700 

Gin Pro Tyr Tyr Pro Gin His Pro Met Val Ala Glu Ser Pro Ser Cys 
705 710 715 720 

Leu Val Ala Thr Met Ala Pro Cys Gin Gin Phe Arg Thr Gly Leu Ser 

725 730 735 

Ser Pro Asp Ala Arg Tyr Gin Gin Gin Asn Pro Ala Ala Val Leu Tyr 

740 745 750 

Gin Arg Ser Lys Ser Leu Ser Pro Ser Leu Leu Gly Tyr Gin Gin Pro 

755 760 765 

Ala Leu Met Ala Ala Pro Leu Ser Leu Ala Asp Ala His Arg Ser Val 

770 775 780 

Leu Val His Ala Gly Ser Gin Gly Gin Ser Ser Ala Leu Leu His Pro 
785 790 795 800 

Ser Pro Thr Asn Gin Gin Ala Ser Pro Val lie His Tyr Ser Pro Thr 

805 810 815 

Asn Gin Gin Leu Arg Cys Gly Ser His Gin Glu Phe Gin His lie Met 

820 825 830 

Tyr Cys Glu Asn Phe Ala Pro Gly Thr Thr Arg Pro Gly Pro Pro Pro 

835 840 845 

Val Ser Gin Gly Gin Arg Leu Ser Pro Gly Ser Tyr Pro Thr Val He 

850 855 860 

Gin Gin Gin Asn Ala Thr Ser Gin Arg Ala Ala Lys Asn Gly Pro Pro 
865 870 875 880 

Val Ser Asp Gin Lys Glu Val Leu Pro Ala Gly Val Thr He Lys Gin 

885 890 895 

Glu Gin Asn Leu Asp Gin Thr Tyr Leu Asp Asp Val Asn Glu He He 

900 905 910 

Arg Lys Glu Phe Ser Gly Pro Pro Ala Arg Asn Gin Thr Arg He Leu 

915 920 925 

Gin Ser Thr Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val 

930 935 940 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 
945 950 955 960 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 

965 970 975 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr 

980 985 990 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

995 1000 1005 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 

1010 1015 1020 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
025 1030 1035 1040 

He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

1045 1050 1055 

Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp 

1060 1065 1070 

Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr 

1075 1080 1085 

Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 

1090 1095 1100 

Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 
105 1110 1115 1120 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val 



1125 1130 1135 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

1140 1145 1150 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

1155 1160 1165 

Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1170 1175 1180 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 1. . .2799 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 52 8 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GGG AGC ATG GGC ACC TTG CGG GAT TTA CAG TAC 7 68 

Gly Leu Arg Ser Arg Gly Ser Met Gly Thr Leu Arg Asp Leu Gin Tyr 
245 250 255 

GCG CTC CAG GAG AAG ATC GAG GAG CTG AGG CAG CGG GAT GCT CTC ATC 816 
Ala Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 
260 265 270 

GAC GAG CTG GAG CTG GAG TTG GAT CAG AAG GAC GAA CTG ATC CAG AAG 864 
Asp Glu Leu Glu Leu Glu Leu Asp Gin Lys Asp Glu Leu He Gin Lys 
275 280 285 

CTG CAG AAC GAG CTG GAC AAG TAC CGC TCG GTG ATC CGA CCA GCC ACC 912 
Leu Gin Asn Glu Leu Asp Lys Tyr Arg Ser Val He Arg Pro Ala Thr 
290 295 300 

CAG CAG GCG CAG AAG CAG AGC GCG AGC ACC TTG CAG GGC GAG CCG CGC 960 
Gin Gin Ala Gin Lys Gin Ser Ala Ser Thr Leu Gin Gly Glu Pro Arg 
305 310 315 320 

ACC AAG CGG CAG GCG ATC TCC GCC GAG CCC ACC GCC TTC GAC ATC CAG 1008 
Thr Lys Arg Gin Ala He Ser Ala Glu Pro Thr Ala Phe Asp He Gin 
325 330 335 

GAT CTC AGC CAT GTG ACC CTG CCC TTC TAC CCC AAG AGC CCA CAG TCC 1056 
Asp Leu Ser His Val Thr Leu Pro Phe Tyr Pro Lys Ser Pro Gin Ser 
340 345 350 

AAG GAT CTT ATA AAG GAA GCT ATC CTT GAC AAT GAC TTT ATG AAG AAC 1104 
Lys Asp Leu He Lys Glu Ala He Leu Asp Asn Asp Phe Met Lys Asn 
355 360 365 



TTG GAG CTG TCG CAG ATC CAG GAG ATT GTG GAT TGT ATG TAC CCG GTG 1152 
Leu Glu Leu Ser Gin lie Gin Glu He Val Asp Cys Met Tyr Pro Val 
370 375 380 

GAG TAT GGC AAG GAC AGT TGC ATC ATC AAA GAA GGA GAC GTG GGG TCA 1200 
Glu Tyr Gly Lys Asp Ser Cys He He Lys Glu Gly Asp Val Gly Ser 
385 390 395 400 

CTG GTG TAT GTC ATG GAA GAT GGT AAG GTT GAA GTT ACA AAA GAA GGT 1248 
Leu Val Tyr Val Met Glu Asp Gly Lys Val Glu Val Thr Lys Glu Gly 
405 410 415 

GTG AAG TTG TGT ACC ATG GGT CCA GGA AAA GTG TTT GGG GAA TTG GCT 1296 
Val Lys Leu Cys Thr Met Gly Pro Gly Lys Val Phe Gly Glu Leu Ala 
420 425 430 

ATT CTT TAC AAC TGT ACC CGG ACA GCG ACC GTC AAG ACT CTT GTA AAT 1344 
He Leu Tyr Asn Cys Thr Arg Thr Ala Thr Val Lys Thr Leu Val Asn 
435 440 445 

GTA AAA CTC TGG GCC ATT GAT CGA CAA TGT TTT CAA ACA ATA ATG ATG 1392 
Val Lys Leu Trp Ala He Asp Arg Gin Cys Phe Gin Thr He Met Met 
450 455 460 

AGG ACA GGA CTC ATC AAG CAT ACC GAG TAT ATG GAA TTT TTA AAA AGC 1440 
Arg Thr Gly Leu He Lys His Thr Glu Tyr Met Glu Phe Leu Lys Ser 
465 470 475 480 

GTT CCA ACA TTC CAG AGC CTT CCT GAA GAG ATC CTC AGC AAG CTT GCT 14 88 
Val Pro Thr Phe Gin Ser Leu Pro Glu Glu He Leu Ser Lys Leu Ala 
485 490 495 

GAT GTC CTT GAA GAG ACC CAC TAT GAA AAT GGA GAA TAT ATT ATC AGG 1536 
Asp Val Leu Glu Glu Thr His Tyr Glu Asn Gly Glu Tyr He He Arg 
500 505 510 

CAA GGT GCA AGA GGG GAC ACC TTC TTT ATC ATC AGC AAA GGA ACG GTA 1584 
Gin Gly Ala Arg Gly Asp Thr Phe Phe He He Ser Lys Gly Thr Val 
515 520 525 

AAT GTC ACT CGT GAA GAC TCA CCG AGT GAA GAC CCA GTC TTT CTT AGA 1632 
Asn Val Thr Arg Glu Asp Ser Pro Ser Glu Asp Pro Val Phe Leu Arg 
530 535 540 

ACT TTA GGA AAA GGA GAC TGG TTT GGA GAG AAA GCC TTG CAG GGG GAA 1680 
Thr Leu Gly Lys Gly Asp Trp Phe Gly Glu Lys Ala Leu Gin Gly Glu 
545 550 555 560 

GAT GTG AGA ACA GCA AAC GTA ATT GCT GCA GAA GCT GTA ACC TGC CTT 1728 
Asp Val Arg Thr Ala Asn Val He Ala Ala Glu Ala Val Thr Cys Leu 
565 570 575 

GTG ATT GAC AGA GAC TCT TTT AAA CAT TTG ATT GGA GGG CTG GAT GAT 1776 
Val He Asp Arg Asp Ser Phe Lys His Leu He Gly Gly Leu Asp Asp 
580 585 590 

GTT TCT AAT AAA GCA TAT GAA GAT GCA GAA GCT AAA GCA AAA TAT GAA 1824 
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Val Ser Asn Lys Ala Tyr Glu Asp Ala Glu Ala Lys Ala Lys Tyr Glu 
595 600 605 

GCT GAA GCG GCT TTC TTC GCC AAC CTG AAG CTG TCT GAT TTC AAC ATC 1872 
Ala Glu Ala Ala Phe Phe Ala Asn Leu Lys Leu Ser Asp Phe Asn lie 
610 615 620 

ATT GAT ACC CTT GGA GTT GGA GGT TTC GGA CGA GTA GAA CTG GTC CAG 1920 
lie Asp Thr Leu Gly Val Gly Gly Phe Gly Arg Val Glu Leu Val Gin 
625 630 635 640 

TTG AAA AGT GAA GAA TCC AAA ACG TTT GCA ATG AAG ATT CTC AAG AAA 1968 
Leu Lys Ser Glu Glu Ser Lys Thr Phe Ala Met Lys lie Leu Lys Lys 
645 650 655 

CGT CAC ATT GTG GAC AC A AGA CAG CAG GAG CAC ATC CGC TCA GAG AAG 2016 
Arg His lie Val Asp Thr Arg Gin Gin Glu His lie Arg Ser Glu Lys 
660 665 670 

CAG ATC ATG CAG GGG GCT CAT TCC GAT TTC ATA GTG AGA CTG TAC AGA 2064 
Gin lie Met Gin Gly Ala His Ser Asp Phe lie Val Arg Leu Tyr Arg 
675 680 685 

ACA TTT AAG GAC AGC AAA TAT TTG TAT ATG TTG ATG GAA GCT TGT CTA 2112 
Thr Phe Lys Asp Ser Lys Tyr Leu Tyr Met Leu Met Glu Ala Cys Leu 
690 695 700 

GGT GGA GAG CTC TGG ACC ATT CTC AGG GAT AGA GGT TCG TTT GAA GAT 2160 
Gly Gly Glu Leu Trp Thr lie Leu Arg Asp Arg Gly Ser Phe Glu Asp 
705 710 715 720 

TCT ACA ACC AGA TTT TAC ACA GCA TGT GTG GTA GAA GCT TTT GCC TAT 2208 
Ser Thr Thr Arg Phe Tyr Thr Ala Cys Val Val Glu Ala Phe Ala Tyr 
725 730 735 

CTG CAT TCC AAA GGA ATC ATT TAC AGG GAC CTC AAG CCA GAA AAT CTC 22 56 
Leu His Ser Lys Gly lie lie Tyr Arg Asp Leu Lys Pro Glu Asn Leu 
740 745 750 

ATC CTA GAT CAC CGA GGT TAT GCC AAA CTG GTT GAT TTT GGC TTT GCA 2304 
lie Leu Asp His Arg Gly Tyr Ala Lys Leu Val Asp Phe Gly Phe Ala 
755 760 765 

AAG AAA ATA GGA TTT GGA AAG AAA ACA TGG ACT TTT TGT GGG ACT CCA 23 52 
Lys Lys He Gly Phe Gly Lys Lys Thr Trp Thr Phe Cys Gly Thr Pro 
770 775 780 

GAG TAT GTA GCC CCA GAG ATC ATC CTG AAC AAA GGC CAT GAC ATT TCA 2400 
Glu Tyr Val Ala Pro Glu He He Leu Asn Lys Gly His Asp He Ser 
785 790 795 800 

GCC GAC TAC TGG TCA CTG GGA ATC CTA ATG TAT GAA CTC CTG ACT GGC 2448 
Ala Asp Tyr Trp Ser Leu Gly He Leu Met Tyr Glu Leu Leu Thr Gly 
805 810 815 

AGC CCA CCT TTC TCA GGC CCA GAT CCT ATG AAA ACC TAT AAC ATC ATA 2496 
Ser Pro Pro Phe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asn He He 
820 825 830 
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TTG AGG GGG ATT GAC ATG ATA GAA TTT CCA AAG AAG ATT GCC AAA AAT 2544 
Leu Arg Gly lie Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 
835 840 845 

GCT GCT AAT TTA ATT AAA AAA CTA TGC AGG GAC AAT CCA TCA GAA AGA 2592 
Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arg 
850 855 860 

TTA GGG AAT TTG AAA AAT GGA GTA AAA GAC ATT CAA AAG CAC AAA TGG 2640 
Leu Gly Asn Leu Lys Asn Gly Val Lys Asp He Gin Lys His Lys Trp 
865 870 875 880 

TTT GAG GGC TTT AAC TGG GAA GGC TTA AGA AAA GGT ACC TTG ACA CCT 2688 
Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 
885 890 895 

CCT ATA ATA CCA AGT GTT GCA TCA CCC ACA GAC ACA AGT AAT TTT GAC 2736 
Pro lie He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 
900 905 910 

AGT TTC CCT GAG GAC AAC GAT GAA CCA CCA CCT GAT GAC AAC TCA GGA 27 84 
Ser Phe Pro Glu Asp Asn Asp Glu Pro Pro Pro Asp Asp Asn Ser Gly 
915 920 925 

TGG GAT ATA GAC TTC TAA 2 802 

Trp Asp He Asp Phe 
930 



(2) INFORMATION FOR SEQ ID NO: 13 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 



115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Gly Ser Met Gly Thr Leu Arg Asp Leu Gin Tyr 

245 250 255 

Ala Leu Gin Glu Lys He Glu Glu Leu Arg Gin Arg Asp Ala Leu He 

260 265 270 

Asp Glu Leu Glu Leu Glu Leu Asp Gin Lys Asp Glu Leu He Gin Lys 

275 280 285 

Leu Gin Asn Glu Leu Asp Lys Tyr Arg Ser Val He Arg Pro Ala Thr 

290 295 300 

Gin Gin Ala Gin Lys Gin Ser Ala Ser Thr Leu Gin Gly Glu Pro Arg 
305 310 315 320 

Thr Lys Arg Gin Ala He Ser Ala Glu Pro Thr Ala Phe Asp He Gin 

325 330 335 

Asp Leu Ser His Val Thr Leu Pro Phe Tyr Pro Lys Ser Pro Gin Ser 

340 345 350 

Lys Asp Leu He Lys Glu Ala He Leu Asp Asn Asp Phe Met Lys Asn 

355 360 365 

Leu Glu Leu Ser Gin He Gin Glu He Val Asp Cys Met Tyr Pro Val 

370 375 380 

Glu Tyr Gly Lys Asp Ser Cys He He Lys Glu Gly Asp Val Gly Ser 
385 390 395 400 

Leu Val Tyr Val Met Glu Asp Gly Lys Val Glu Val Thr Lys Glu Gly 

405 410 415 

Val Lys Leu Cys Thr Met Gly Pro Gly Lys Val Phe Gly Glu Leu Ala 

420 425 430 

He Leu Tyr Asn Cys Thr Arg Thr Ala Thr Val Lys Thr Leu Val Asn 

435 440 445 

Val Lys Leu Trp Ala He Asp Arg Gin Cys Phe Gin Thr He Met Met 

450 455 460 

Arg Thr Gly Leu He Lys His Thr Glu Tyr Met Glu Phe Leu Lys Ser 
465 470 475 480 

Val Pro Thr Phe Gin Ser Leu Pro Glu Glu He Leu Ser Lys Leu Ala 

485 490 495 

Asp Val Leu Glu Glu Thr His Tyr Glu Asn Gly Glu Tyr He He Arg 

500 505 510 

Gin Gly Ala Arg Gly Asp Thr Phe Phe He He Ser Lys Gly Thr Val 

515 520 525 

Asn Val Thr Arg Glu Asp Ser Pro Ser Glu A.sp Pro Val Phe Leu Arg 

530 535 540 

Thr Leu Gly Lys Gly Asp Trp Phe Gly Glu Lys Ala Leu Gin Gly Glu 
545 550 555 560 

Asp Val Arg Thr Ala Asn Val He Ala Ala Glu Ala Val Thr Cys Leu 

565 570 575 

Val He Asp Arg Asp Ser Phe Lys His Leu He Gly Gly Leu Asp Asp 
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580 585 590 

Val Ser Asn Lys Ala Tyr Glu Asp Ala Glu Ala Lys Ala Lys Tyr Glu 

595 600 605 

Ala Glu Ala Ala Phe Phe Ala Asn Leu Lys Leu Ser Asp Phe Asn lie 

610 615 620 

lie Asp Thr Leu Gly Val Gly Gly Phe Gly Arg Val Glu Leu Val Gin 
625 630 635 640 

Leu Lys Ser Glu Glu Ser Lys Thr Phe Ala Met Lys lie Leu Lys Lys 

645 650 655 

Arg His lie Val Asp Thr Arg Gin Gin Glu His He Arg Ser Glu Lys 

660 665 670 

Gin He Met Gin Gly Ala His Ser Asp Phe He Val Arg Leu Tyr Arg 

675 680 685 

Thr Phe Lys Asp Ser Lys Tyr Leu Tyr Met Leu Met Glu Ala Cys Leu 

690 695 700 

Gly Gly Glu Leu Trp Thr He Leu Arg Asp Arg Gly Ser Phe Glu Asp 
705 710 715 720 

Ser Thr Thr Arg Phe Tyr Thr Ala Cys Val Val Glu Ala Phe Ala Tyr 

725 730 735 

Leu His Ser Lys Gly He He Tyr Arg Asp Leu Lys Pro Glu Asn Leu 

740 745 750 

He Leu Asp His Arg Gly Tyr Ala Lys Leu Val Asp Phe Gly Phe Ala 

755 760 765 

Lys Lys He Gly Phe Gly Lys Lys Thr Trp Thr Phe Cys Gly Thr Pro 

770 775 780 

Glu Tyr Val Ala Pro Glu lie He Leu Asn Lys Gly His Asp He Ser 
785 790 795 800 

Ala Asp Tyr Trp Ser Leu Gly He Leu Met Tyr Glu Leu Leu Thr Gly 

805 810 815 

Ser Pro Pro Fhe Ser Gly Pro Asp Pro Met Lys Thr Tyr Asn He lie 

820 825 830 

Leu Arg Gly He Asp Met He Glu Phe Pro Lys Lys He Ala Lys Asn 

835 840 845 

Ala Ala Asn Leu He Lys Lys Leu Cys Arg Asp Asn Pro Ser Glu Arg 

850 855 860 

Leu Gly Asn Leu Lys Asn Gly Val Lys Asp lie Gin Lys His Lys Trp 
865 870 875 880 

Phe Glu Gly Phe Asn Trp Glu Gly Leu Arg Lys Gly Thr Leu Thr Pro 

885 890 895 

Pro He He Pro Ser Val Ala Ser Pro Thr Asp Thr Ser Asn Phe Asp 

900 905 910 

Ser Phe Pro Glu Asp Asn Asp Glu Pro Pro Pro Asp Asp Asn Ser Gly 

915 920 925 

Trp Asp lie Asp Phe 
930 

(2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE : 



(A) NAME /KEY : Coding Sequence 



(B) LOCATION: 1...2795 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

ATG GGC ACC TTG CGG GAT TTA CAG TAC GCG CTC CAG GAG AAG ATC GAG 48 
Met Gly Thr Leu Arg Asp Leu Gin Tyr Ala Leu Gin Glu Lys He Glu 
15 10 15 

GAG CTG AGG CAG CGG GAT GCT CTC ATC GAC GAG CTG GAG CTG GAG TTG 96 
Glu Leu Arg Gin Arg Asp Ala Leu He Asp Glu Leu Glu Leu Glu Leu 
20 25 30 

GAT CAG AAG GAC GAA CTG ATC CAG AAG CTG CAG AAC GAG CTG GAC AAG 144 
Asp Gin Lys Asp Glu Leu He Gin Lys Leu Gin Asn Glu Leu Asp Lys 
35 40 45 

TAC CGC TCG GTG ATC CGA CCA GCC ACC CAG CAG GCG CAG AAG CAG AGC 192 
Tyr Arg Ser Val He Arg Pro Ala Thr Gin Gin Ala Gin Lys Gin Ser 
50 55 60 

GCG AGC ACC TTG CAG GGC GAG CCG CGC ACC AAG CGG CAG GCG ATC TCC 240 
Ala Ser Thr Leu Gin Gly Glu Pro Arg Thr Lys Arg Gin Ala He Ser 
65 70 75 80 

GCC GAG CCC ACC GCC TTC GAC ATC CAG GAT CTC AGC CAT GTG ACC CTG 2 88 

Ala Glu Pro Thr Ala Phe Asp He Gin Asp Leu Ser His Val Thr Leu 
85 90 95 

CCC TTC TAC CCC AAG AGC CCA CAG TCC AAG GAT CTT ATA AAG GAA GCT 336 
Pro Phe Tyr Pro Lys Ser Pro Gin Ser Lys Asp Leu He Lys Glu Ala 
100 105 110 

ATC CTT GAC AAT GAC TTT ATG AAG AAC TTG GAG CTG TCG CAG ATC CAG 3 84 

He Leu Asp Asn Asp Phe Met Lys Asn Leu Glu Leu Ser Gin He Gin 
115 120 125 

GAG ATT GTG GAT TGT ATG TAC CCG GTG GAG TAT GGC AAG GAC AGT TGC 432 
Glu He Val Asp Cys Met Tyr Pro Val Glu Tyr Gly Lys Asp Ser Cys 
130 135 140 

ATC ATC AAA GAA GGA GAC GTG GGG TCA CTG GTG TAT GTC ATG GAA GAT 480 
He He Lys Glu Gly Asp Val Gly Ser Leu Val Tyr Val Met Glu Asp 
145 150 155 160 

GGT AAG GTT GAA GTT ACA AAA GAA GGT GTG AAG TTG TGT ACC ATG GGT 528 
Gly Lys Val Glu Val Thr Lys Glu Gly Val Lys Leu Cys Thr Met Gly 
165 170 175 

CCA GGA AAA GTG TTT GGG GAA TTG GCT ATT CTT TAC AAC TGT ACC CGG 576 
Pro Gly Lys Val Phe Gly Glu Leu Ala He Leu Tyr Asn Cys Thr Arg 
180 185 190 

ACA GCG ACC GTC AAG ACT CTT GTA AAT GTA AAA CTC TGG GCC ATT GAT 624 
Thr Ala Thr Val Lys Thr Leu Val Asn Val Lys Leu Trp Ala He Asp 
195 200 205 

CGA CAA TGT TTT CAA ACA ATA ATG ATG AGG ACA GGA CTC ATC AAG CAT 672 



2 / Y 



Arg Gin Cys Phe Gin Thr lie Met Met Arg Thx Gly Leu lie Lys His 
210 215 220 

ACC GAG TAT ATG GAA TTT TTA AAA AGC GTT CCA ACA TTC CAG AGC CTT 720 
Thx Glu Tyr Met Glu Phe Leu Lys Ser Val Pro Thr Phe Gin Ser Leu 
225 230 235 240 

CCT GAA GAG ATC CTC AGC AAG CTT GCT GAT GTC CTT GAA GAG ACC CAC 7 68 

Pro Glu Glu lie Leu Ser Lys Leu Ala Asp Val Leu Glu Glu Thr His 
245 250 255 

TAT GAA AAT GGA GAA TAT ATT ATC AGG CAA GGT GCA AGA GGG GAC ACC 816 
Tyr Glu Asn Gly Glu Tyr lie lie Arg Gin Gly Ala Arg Gly Asp Thr 
260 265 270 

TTC TTT ATC ATC AGC AAA GGA ACG GTA AAT GTC ACT CGT GAA GAC TCA 864 
Phe Phe He He Ser Lys Gly Thr Val Asn Val Thr Arg Glu Asp Ser 
275 280 285 

CCG AGT GAA GAC CCA GTC TTT CTT AGA ACT TTA GGA AAA GGA GAC TGG 912 
Pro Ser Glu Asp Pro Val Phe Leu Arg Thr Leu Gly Lys Gly Asp Trp 
290 295 300 

TTT GGA GAG AAA GCC TTG CAG GGG GAA GAT GTG AGA ACA GCA AAC GTA 960 
Phe Gly Glu Lys Ala Leu Gin Gly Glu Asp Val Arg Thr Ala Asn Val 
305 310 315 320 

ATT GCT GCA GAA GCT GTA ACC TGC CTT GTG ATT GAC AGA GAC TCT TTT 1008 
He Ala Ala Glu Ala Val Thr Cys Leu Val He Asp Arg Asp Ser Phe 
325 330 335 

AAA CAT TTG ATT GGA GGG CTG GAT GAT GTT TCT AAT AAA GCA TAT GAA 1056 
Lys His Leu He Gly Gly Leu Asp Asp Val Ser Asn Lys Ala Tyr Glu 
340 345 350 

GAT GCA GAA GCT AAA GCA AAA TAT GAA GCT GAA GCG GCT TTC TTC GCC 1104 
Asp Ala Glu Ala Lys Ala Lys Tyr Glu Ala Glu Ala Ala Phe Phe Ala 
355 360 365 

AAC CTG AAG CTG TCT GAT TTC AAC ATC ATT GAT ACC CTT GGA GTT GGA 1152 
Asn Leu Lys Leu Ser Asp Phe Asn He He Asp Thr Leu Gly Val Gly 
370 375 380 

GGT TTC GGA CGA GTA GAA CTG GTC CAG TTG AAA AGT GAA GAA TCC AAA 1200 
Gly Phe Gly Arg Val Glu Leu Val Gin Leu Lys Ser Glu Glu Ser Lys 
385 390 395 400 

ACG TTT GCA ATG AAG ATT CTC AAG AAA CGT CAC ATT GTG GAC ACA AGA 124 8 
Thr Phe Ala Met Lys lie Leu Lys Lys Arg His He Val Asp Thr Arg 
405 410 415 

CAG CAG GAG CAC ATC CGC TCA GAG AAG CAG ATC ATG CAG GGG GCT CAT 1296 
Gin Gin Glu His He Arg Ser Glu Lys Gin He Met Gin Gly Ala His 
420 425 430 

TCC GAT TTC ATA GTG AGA CTG TAC AGA ACA TTT AAG GAC AGC AAA TAT 1344 
Ser Asp Phe He Val Arg Leu Tyr Arg Thr Phe Lys Asp Ser Lys Tyr 
435 440 445 



TTG TAT ATG 
Leu Tyr Met 
450 

CTC AGG GAT 
Leu Arg Asp 
465 

GCA TGT GTG 
Ala Cys Val 



TAC AGG GAC 
Tyr Arg Asp 



GCC AAA CTG 
Ala Lys Leu 
515 

AAA ACA TGG 
Lys Thr Trp 
530 

ATC CTG AAC 
lie Leu Asn 
545 

ATC CTA ATG 
lie Leu Met 



GAT CCT ATG 
Asp Pro Met 



GAA TTT CCA 
Glu Phe Pro 
595 

CTA TGC AGG 
Leu Cys Arg 
610 

GTA AAA GAC 
Val Lys Asp 
625 

GGC TTA AGA 
Gly Leu Arg 



TCA CCC ACA 
Ser Pro Thr 



GAA CCA CCA 



TTG ATG GAA 
Leu Met Glu 



AGA GGT TCG 
Arg Gly Ser 
470 

GTA GAA GCT 
Val Glu Ala 
485 

CTC AAG CCA 
Leu Lys Pro 
500 

GTT GAT TTT 
Val Asp Phe 



ACT TTT TGT 
Thr Phe Cys 

AAA GGC CAT 
Lys Gly His 
550 

TAT GAA CTC 
Tyr Glu Leu 
565 

AAA ACC TAT 
Lys Thr Tyr 
580 

AAG AAG ATT 
Lys Lys lie 



GAC AAT CCA 
Asp Asn Pro 



ATT CAA AAG 
lie Gin Lys 
630 

AAA GGT ACC 
Lys Gly Thr 
645 

GAC ACA AGT 
Asp Thr Ser 
660 

CCT GAT GAC 



GCT TGT CTA 
Ala Cys Leu 
455 

TTT GAA GAT 
Phe Glu Asp 



TTT GCC TAT 
Phe Ala Tyr 



GAA AAT CTC 
Glu Asn Leu 
505 

GGC TTT GCA 
Gly Phe Ala 
520 

GGG ACT CCA 
Gly Thr Pro 
535 

GAC ATT TCA 
Asp lie Ser 



CTG ACT GGC 
Leu Thr Gly 



AAC ATC ATA 
Asn lie lie 
585 

GCC AAA AAT 
Ala Lys Asn 
600 

TCA GAA AGA 
Ser Glu Arg 
615 

CAC AAA TGG 
His Lys Trp 



TTG ACA CCT 
Leu Thr Pro 



AAT TTT GAC 
Asn Phe Asp 
665 

AAC TCA GGA 



GGT GGA GAG 
Gly Gly Glu 
460 

TCT ACA ACC 
Ser Thr Thr 
475 

CTG CAT TCC 
Leu His Ser 
490 

ATC CTA GAT 
lie Leu Asp 



AAG AAA ATA 
Lys Lys He 



GAG TAT GTA 
Glu Tyr Val 
540 

GCC GAC TAC 
Ala Asp Tyr 
555 

AGC CCA CCT 
Ser Pro Pro 
570 

TTG AGG GGG 
Leu Arg Gly 



GCT GCT AAT 
Ala Ala Asn 



TTA GGG AAT 
Leu Gly Asn 
620 

TTT GAG GGC 
Phe Glu Gly 
635 

CCT ATA ATA 
Pro He He 
650 

AGT TTC CCT 
Ser Phe Pro 



TGG GAT ATA 



CTC TGG ACC 
Leu Trp Thr 



AGA TTT TAC 
Arg Phe Tyr 



AAA GGA ATC 
Lys Gly He 
495 

CAC CGA GGT 
His Arg Gly 
510 

GGA TTT GGA 
Gly Phe Gly 
525 

GCC CCA GAG 
Ala Pro Glu 



TGG TCA CTG 
Trp Ser Leu 



TTC TCA GGC 
Phe Ser Gly 
575 

ATT GAC ATG 
He Asp Met 
590 

TTA ATT AAA 
Leu He Lys 
605 

TTG AAA AAT 
Leu Lys Asn 

TTT AAC TGG 
Phe Asn Trp 



CCA AGT GTT 
Pro Ser Val 
655 

GAG GAC AAC 
Glu Asp Asn 
670 

GAC TTC TCG 



ATT 1392 
lie 



ACA 1440 

Thr 

480 

ATT 1488 
He 



TAT 1536 
Tyr 



AAG 1584 
Lys 



ATC 1632 
He 



GGA 1680 

Gly 

560 

CCA 1728 
Pro 



ATA 1776 
He 



AAA 1824 
Lys 



GGA 1872 
Gly 



GAA 1920 

Glu 

640 

GCA 1968 
Ala 



GAT 2016 
Asp 



GAT 2064 



Glu Pro Pro Pro Asp Asp Asn Eer Gly Trp Asp lie Asp Phe Ser Asp 
675 680 685 

CCA CCG GTC GCC ACC ATG GTG AGO AAG GGC GAG GAG CTG TTC ACC GGG 2112 
Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 
690 695 700 

GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG 2160 
Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
705 710 715 720 

TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG 2208 
Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
725 730 735 

ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC 22 56 
Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
740 745 750 

ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC 2304 
Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 
755 760 765 

CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA 23 52 
Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
770 775 780 

GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC 2400 
Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 
785 790 795 800 

AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC 2448 
Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
805 810 815 

ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG 2496 
He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
820 825 830 

CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC 2544 
His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
835 840 845 

GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC 2592 
Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
850 855 860 

ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC 2640 
lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
865 870 875 880 

CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC 2688 
Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
885 890 895 

ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG 273 6 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
900 905 910 



.2/? 



GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC 27 84 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp 
915 920 925 

GAG CTG TAC AA GTAA 279 9 

Glu Leu Tyr Lys 
930 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 932 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Met Gly Thr Leu Arg Asp Leu Gin Tyr Ala Leu Gin Glu Lys lie Glu 

15 10 15 

Glu Leu Arg Gin Arg Asp Ala Leu lie Asp Glu Leu Glu Leu Glu Leu 

20 25 30 

Asp Gin Lys Asp Glu Leu lie Gin Lys Leu Gin Asn Glu Leu Asp Lys 

35 40 45 

Tyr Arg Ser Val lie Arg Pro Ala Thr Gin Gin Ala Gin Lys Gin Ser 

50 55 60 

Ala Ser Thr Leu Gin Gly Glu Pro Arg Thr Lys Arg Gin Ala lie Ser 
65 70 75 80 

Ala Glu Pro Thr Ala Phe Asp lie Gin Asp Leu Ser His Val Thr Leu 

85 90 95 

Pro Phe Tyr Pro Lys Ser Pro Gin Ser Lys Asp Leu lie Lys Glu Ala 

100 105 110 

lie Leu Asp Asn Asp Phe Met Lys Asn Leu Glu Leu Ser Gin lie Gin 

115 120 125 

Glu lie Val Asp Cys Met Tyr Pro Val Glu Tyr Gly Lys Asp Ser Cys 

130 135 140 

He He Lys Glu Gly Asp Val Gly Ser Leu Val Tyr Val Met Glu Asp 
145 150 155 160 

Gly Lys Val Glu Val Thr Lys Glu Gly Val Lys Leu Cys Thr Met Gly 

165 170 175 

Pro Gly Lys Val Phe Gly Glu Leu Ala He Leu Tyr Asn Cys Thr Arg 

180 185 190 

Thr Ala Thr Val Lys Thr Leu Val Asn Val Lys Leu Trp Ala He Asp 

195 200 205 

Arg Gin Cys Phe Gin Thr He Met Met Arg Thr Gly Leu He Lys His 

210 215 220 

Thr Glu Tyr Met Glu Phe Leu Lys Ser Val Pro Thr Phe Gin Ser Leu 
225 230 235 240 

Pro Glu Glu He Leu Ser Lys Leu Ala Asp Val Leu Glu Glu Thr His 

245 250 255 

Tyr Glu Asn Gly Glu Tyr He He Arg Gin Gly Ala Arg Gly Asp Thr 

260 265 270 

Phe Phe lie He Ser Lys Gly Thr Val Asn Val Thr Arg Glu Asp Ser 



2 /cf 



275 280 285 

Pro Ser Glu Asp Pro Val Phe Leu Arg Thr Leu Gly Lys Gly Asp Trp 

290 295 300 

Phe Gly Glu Lys Ala Leu Gin Gly Glu Asp Val Arg Thr Ala Asn Val 
305 310 315 320 

lie Ala Ala Glu Ala Val Thr Cys Leu Val lie Asp Arg Asp Ser Phe 

325 330 335 

Lys His Leu lie Gly Gly Leu Asp Asp Val Ser Asn Lys Ala Tyr Glu 

340 345 350 

Asp Ala Glu Ala Lys Ala Lys Tyr Glu Ala Glu Ala Ala Phe Phe Ala 

355 360 365 

Asn Leu Lys Leu Ser Asp Phe Asn lie lie Asp Thr Leu Gly Val Gly 

370 375 380 

Gly Phe Gly Arg Val Glu Leu Val Gin Leu Lys Ser Glu Glu Ser Lys 
385 390 395 400 

Thr Phe Ala Met Lys lie Leu Lys Lys Arg His lie Val Asp Thr Arg 

405 410 415 

Gin Gin Glu His lie Arg Ser Glu Lys Gin lie Met Gin Gly Ala His 

420 425 430 

Ser Asp Phe lie Val Arg Leu Tyr Arg Thr Phe Lys Asp Ser Lys Tyr 

435 440 445 

Leu Tyr Met Leu Met Glu Ala Cys Leu Gly Gly Glu Leu Trp Thr lie 

450 455 460 

Leu Arg Asp Arg Gly Ser Phe Glu Asp Ser Thr Thr Arg Phe Tyr Thr 
465 470 475 480 

Ala Cys Val Val Glu Ala Phe Ala Tyr Leu His Ser Lys Gly He He 

485 490 495 

Tyr Arg Asp Leu Lys Pro Glu Asn Leu He Leu Asp His Arg Gly Tyr 

500 505 510 

Ala Lys Leu Val Asp Phe Gly Phe Ala Lys Lys He Gly Phe Gly Lys 

515 520 525 

Lys Thr Trp Thr Phe Cys Gly Thr Pro Glu Tyr Val Ala Pro Glu He 

530 535 540 

He Leu Asn Lys Gly His Asp He Ser Ala Asp Tyr Trp Ser Leu Gly 
545 550 555 560 

He Leu Met Tyr Glu Leu Leu Thr Gly Ser Pro Pro Phe Ser Gly Pro 

565 570 575 

Asp Pro Met Lys Thr Tyr Asn He He Leu Arg Gly He Asp Met He 

580 585 590 

Glu Phe Pro Lys Lys lie Ala Lys Asn Ala Ala Asn Leu He Lys Lys 

595 600 605 

Leu Cys Arg Asp Asn Pro Ser Glu Arg Leu Gly Asn Leu Lys Asn Gly 

610 615 620 

Val Lys Asp lie Gin Lys His Lys Trp Phe Glu Gly Phe Asn Trp Glu 
625 630 635 640 

Gly Leu Arg Lys Gly Thr Leu Thr Pro Pro lie lie Pro Ser Val Ala 

645 650 655 

Ser Pro Thr Asp Thr Ser Asn Phe Asp Ser Phe Pro Glu Asp Asn Asp 

660 665 670 

Glu Pro Pro Pro Asp Asp Asn Ser Gly Trp Asp He Asp Phe Ser Asp 

675 680 685 

Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly 

690 695 700 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
705 710 715 720 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 

725 730 735 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 



as? 



740 745 750 

Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 

755 760 765 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

770 775 780 

Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 
785 790 795 800 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

805 810 815 

lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 

820 825 830 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala 

835 840 845 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys He Arg His Asn 

850 855 860 

He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
865 870 875 880 

Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 

885 890 895 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 

900 905 910 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp 

915 920 925 

Glu Leu Tyr Lys 
930 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2184 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2181 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 2 40 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 3 36 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GGC ACC ATG AGC GAC GTG GCT ATT GTG AAG GAG 7 68 

Gly Leu Arg Ser Arg Gly Thr Met Ser Asp Val Ala lie Val Lys Glu 
245 250 255 

GGT TGG CTG CAC AAA CGA GGG GAG TAC ATC AAG ACC TGG CGG CCA CGC 816 
Gly Trp Leu His Lys Arg Gly Glu Tyr lie Lys Thr Trp Arg Pro Arg 
260 265 270 

TAC TTC CTC CTC AAG AAT GAT GGC ACC TTC ATT GGC TAC AAG GAG CGG 864 
Tyr Phe Leu Leu Lys Asn Asp Gly Thr Phe lie Gly Tyr Lys Glu Arg 
275 280 285 

CCG CAG GAT GTG GAC CAA CGT GAG GCT CCC CTC AAC AAC TTC TCT GTG 912 



2 2 f 



Pro Gin Asp Val Asp Gin Arg Glu Ala Pro Leu Asn Asn Phe Ser Val 
290 295 300 

GCG CAG TGC CAG CTG ATG AAG ACG GAG CGG CCC CGG CCC AAC ACC TTC 960 
Ala Gin Cys Gin Leu Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe 
305 310 315 320 

ATC ATC CGC TGC CTG CAG TGG ACC ACT GTC ATC GAA CGC ACC TTC CAT 1008 
lie He Arg Cys Leu Gin Trp Thr Thr Val He Glu Arg Thr Phe His 
325 330 335 

GTG GAG ACT CCT GAG GAG CGG GAG GAG TGG ACA ACC GCC ATC CAG ACT 1056 
Val Glu Thr Pro Glu Glu Arg Glu Glu Trp Thr Thr Ala He Gin Thr 
340 345 350 

GTG GCT GAC GGC CTC AAG AAG CAG GAG GAG GAG GAG ATG GAC TTC CGG 1104 
Val Ala Asp Gly Leu Lys Lys Gin Glu Glu Glu Glu Met Asp Phe Arg 
355 360 365 

TCG GGC TCA CCC AGT GAC AAC TCA GGG GCT GAA GAG ATG GAG GTG TCC 1152 
Ser Gly Ser Pro Ser Asp Asn Ser Gly Ala Glu Glu Met Glu Val Ser 
370 375 380 

CTG GCC AAG CCC AAG CAC CGC GTG ACC ATG AAC GAG TTT GAG TAC CTG 1200 
Leu Ala Lys Pro Lys His Arg Val Thr Met Asn Glu Phe Glu Tyr Leu 
385 390 395 400 

AAG CTG CTG GGC AAG GGC ACT TTC GGC AAG GTG ATC CTG GTG AAG GAG 1248 
Lys Leu Leu Gly Lys Gly Thr Phe Gly Lys Val He Leu Val Lys Glu 
405 410 415 

AAG GCC ACA GGC CGC TAC TAC GCC ATG AAG ATC CTC AAG AAG GAA GTC 1296 
Lys Ala Thr Gly Arg Tyr Tyr Ala Met Lys He Leu Lys Lys Glu Val 
420 425 430 

ATC GTG GCC AAG GAC GAG GTG GCC CAC ACA CTC ACC GAG AAC CGC GTC 13 44 
He Val Ala Lys Asp Glu Val Ala His Thr Leu Thr Glu Asn Arg Val 
435 440 445 

CTG CAG AAC TCC AGG CAC CCC TTC CTC ACA GCC CTG AAG TAC TCT TTC 1392 
Leu Gin Asn Ser Arg His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe 
450 455 460 

CAG ACC CAC GAC CGC CTC TGC TTT GTC ATG GAG TAC GCC AAC GGG GGC 1440 
Gin Thr His Asp Arg Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly 
465 470 475 480 

GAG CTG TTC TTC CAC CTG TCC CGG GAA CGT GTG TTC TCC GAG GAC CGG 1488 
Glu Leu Phe Phe His Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg 
485 490 495 

GCC CGC TTC TAT GGC GCT GAG ATT GTG TCA GCC CTG GAC TAC CTG CAC 1536 
Ala Arg Phe Tyr Gly Ala Glu He Val Ser Ala Leu Asp Tyr Leu His 
500 505 510 

TCG GAG AAG AAC GTG GTG TAC CGG GAC CTC AAG CTG GAG AAC CTC ATG 1584 
Ser Glu Lys Asn Val Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met 
515 520 525 
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CTG GAC AAG GAC GGG CAC ATT AAG ATC ACA GAC TTC GGG CTG TGC AAG 1632 

Leu Asp Lys Asp Gly His lie Lys lie Thr Asp Phe Gly Leu Cys Lys 
530 535 540 

GAG GGG ATC AAG GAC GGT GCC ACC ATG AAG ACC TTT TGC GGC ACA CCT 1680 

Glu Gly He Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 

545 550 555 560 

GAG TAC CTG GCC CCC GAG GTG CTG GAG GAC AAT GAC TAC GGC CGT GCA 1728 

Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 
565 570 575 

GTG GAC TGG TGG GGG CTG GGC GTG GTC ATG TAC GAG ATG ATG TGC GGT 1776 

Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 
580 585 590 

CGC CTG CCC TTC TAC AAC CAG GAC CAT GAG AAG CTT TTT GAG CTC ATC 1824 

Arg Leu Pro Phe Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu He 

595 600 605 

CTC ATG GAG GAG ATC CGC TTC CCG CGC ACG CTT GGT CCC GAG GCC AAG 1872 

Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 
610 615 620 

TCC TTG CTT TCA GGG CTG CTC AAG AAG GAC CCC AAG CAG AGG CTT GGC 1920 

Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 

625 630 635 640 

GGG GGC TCC GAG GAC GCC AAG GAG ATC ATG CAG CAT CGC TTC TTT GCC 1968 

Gly Gly Ser Glu Asp Ala Lys Glu He Met Gin His Arg Phe Phe Ala 
645 650 655 

GGT ATC GTG TGG CAG CAC GTG TAC GAG AAG AAG CTC AGC CCA CCC TTC 2016 

Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 
660 665 670 

AAG CCC CAG GTC ACG TCG GAG ACT GAC ACC AGG TAT TTT GAT GAG GAG 2064 

Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 

675 680 685 

TTC ACG GCC CAG ATG ATC ACC ATC ACA CCA CCT GAC CAA GAT GAC AGC 2112 

Phe Thr Ala Gin Met He Thr He Thr Pro Pro Asp Gin Asp Asp Ser 
690 695 700 

ATG GAG TGT GTG GAC AGC GAG CGC AGG CCC CAC TTC CCC CAG TTC TCC 2160 

Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 

705 710 715 720 

TAC TCG GCC AGC AGC ACG GCC TGA 2184 
Tyr Ser Ala Ser Ser Thr Ala 
725 



(2) INFORMATION FOR SEQ ID NO: 13 9: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 727 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Gly Thr Met Ser Asp Val Ala He Val Lys Glu 

245 250 255 

Gly Trp Leu His Lys Arg Gly Glu Tyr lie Lys Thr Trp Arg Pro Arg 

260 265 270 

Tyr Phe Leu Leu Lys Asn Asp Gly Thr Phe lie Gly Tyr Lys Glu Arg 

275 280 285 

Pro Gin Asp Val Asp Gin Arg Glu Ala Pro Leu Asn Asn Phe Ser Val 

290 295 300 

Ala Gin Cys Gin Leu Met Lys Thr Glu Arg Pro Arg Pro Asn Thr Phe 
305 310 315 320 

He lie Arg Cys Leu Gin Trp Thr Thr Val He Glu Arg Thr Phe His 

325 330 335 

Val Glu Thr Pro Glu Glu Arg Glu Glu Trp Thr Thr Ala lie Gin Thr 

340 345 350 

Val Ala Asp Gly Leu Lys Lys Gin Glu Glu Glu Glu Met Asp Phe Arg 

355 360 365 

Ser Gly Ser Pro Ser Asp Asn Ser Gly Ala Glu Glu Met Glu Val Ser 

370 375 380 

Leu Ala Lys Pro Lys His Arg Val Thr Met Asn Glu Phe Glu Tyr Leu 
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V 



385 



390 



395 



400 



Lys Leu Leu Gly Lys Gly Thr Phe Gly Lys Val He Leu Val Lys Glu 

405 410 415 

Lys Ala Thr Gly Arg Tyr Tyr Ala Met Lys He Leu Lys Lys Glu Val 

420 425 430 

He Val Ala Lys Asp Glu Val Ala His Thr Leu Thr Glu Asn Arg Val 

435 440 445 

Leu Gin Asn Ser Arg His Pro Phe Leu Thr Ala Leu Lys Tyr Ser Phe 

450 455 460 

Gin Thr His Asp Arg Leu Cys Phe Val Met Glu Tyr Ala Asn Gly Gly 
465 470 475 480 

Glu Leu Phe Phe His Leu Ser Arg Glu Arg Val Phe Ser Glu Asp Arg 

485 490 495 

Ala Arg Phe Tyr Gly Ala Glu He Val Ser Ala Leu Asp Tyr Leu His 

500 505 510 

Ser Glu Lys Asn Val Val Tyr Arg Asp Leu Lys Leu Glu Asn Leu Met 

515 520 525 

Leu Asp Lys Asp Gly His lie Lys He Thr Asp Phe Gly Leu Cys Lys 

530 535 540 

Glu Gly He Lys Asp Gly Ala Thr Met Lys Thr Phe Cys Gly Thr Pro 
545 550 555 560 

Glu Tyr Leu Ala Pro Glu Val Leu Glu Asp Asn Asp Tyr Gly Arg Ala 

565 570 575 

Val Asp Trp Trp Gly Leu Gly Val Val Met Tyr Glu Met Met Cys Gly 

580 585 590 

Arg Leu Pro Phe Tyr Asn Gin Asp His Glu Lys Leu Phe Glu Leu He 

595 600 605 

Leu Met Glu Glu He Arg Phe Pro Arg Thr Leu Gly Pro Glu Ala Lys 

610 615 620 

Ser Leu Leu Ser Gly Leu Leu Lys Lys Asp Pro Lys Gin Arg Leu Gly 
625 630 635 640 

Gly Gly Ser Glu Asp Ala Lys Glu lie Met Gin His Arg Phe Phe Ala 

645 650 655 

Gly He Val Trp Gin His Val Tyr Glu Lys Lys Leu Ser Pro Pro Phe 

660 665 670 

Lys Pro Gin Val Thr Ser Glu Thr Asp Thr Arg Tyr Phe Asp Glu Glu 

675 680 685 

Phe Thr Ala Gin Met lie Thr He Thr Pro Pro Asp Gin Asp Asp Ser 

690 695 700 

Met Glu Cys Val Asp Ser Glu Arg Arg Pro His Phe Pro Gin Phe Ser 
705 710 715 720 

Tyr Ser Ala Ser Ser Thr Ala 



(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2391 
(D) OTHER INFORMATION: 



22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

ATG GAC GAA CTG TTC CCC CTC ATC TTC CCG GCA GAG CCA GCC CAG GCC 48 
Met Asp Glu Leu Phe Pro Leu lie Phe Pro Ala Glu Pro Ala Gin Ala 
15 10 15 

TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG CCC AAG CAG CGG GGC ATG 96 
Ser Gly Pro Tyr Val Glu lie lie Glu Gin Pro Lys Gin Arg Gly Met 
20 25 30 

CGC TTC CGC TAC AAG TGC GAG GGG CGC TCC GCG GGC AGC ATC CCA GGC 144 
Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser lie Pro Gly 
35 40 45 

GAG AGG AGC AC A GAT ACC ACC AAG ACC CAC CCC ACC ATC AAG ATC AAT 192 
Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr lie Lys He Asn 
50 55 60 

GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC TCC CTG GTC ACC AAG GAC 240 
Gly Tyr Thr Gly Pro Gly Thr Val Arg He Ser Leu Val Thr Lys Asp 
65 70 75 80 

CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT GTA GGA AAG GAC TGC CGG 288 
Pro Pro His Arg Pro His Pro His Glu Leu Val Gly Lys Asp Cys Arg 
85 90 95 

GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG GAC CGC TGC ATC CAC AGT 3 36 

Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro Asp Arg Cys He His Ser 
100 105 110 

TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG AAG CGG GAC CTG GAG CAG 3 84 

Phe Gin Asn Leu Gly He Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
115 120 125 

GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC AAC CCC TTC CAA GTT CCT 432 
Ala He Ser Gin Arg He Gin Thr Asn Asn Asn Pro Phe Gin Val Pro 
130 135 140 

ATA GAA GAG CAG CGT GGG GAC TAC GAC CTG AAT GCT GTG CGG CTC TGC 480 
He Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 
145 150 155 160 

TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC AGG CCC CTC CGC CTG CCG 52 8 

Phe Gin Val Thr Val Arg Asp Pro Ser Gly Arg Pro Leu Arg Leu Pro 
165 170 175 

CCT GTC CTT CCT CAT CCC ATC TTT GAC AAT CGT GCC CCC AAC ACT GCC 57 6 

Pro Val Leu Pro His Pro lie Phe Asp Asn Arg Ala Pro Asn Thr Ala 
180 185 190 

GAG CTC AAG ATC TGC CGA GTG AAC CGA AAC TCT GGC AGC TGC CTC GGT 624 
Glu Leu Lys He Cys Arg Val Asn Arg Asn Ser Gly Ser Cys Leu Gly 
195 200 205 

GGG GAT GAG ATC TTC CTA CTG TGT GAC AAG GTG CAG AAA GAG GAC ATT 672 
Gly Asp Glu He Phe Leu Leu Cys Asp Lys Val Gin Lys Glu Asp He 
210 215 220 
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GAG GTG TAT TTC ACG GGA CCA GGC TGG GAG GCC CGA GGC TCC TTT TCG 720 
Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe Ser 
225 230 235 240 

CAA GCT GAT GTG CAC CGA CAA GTG GCC ATT GTG TTC CGG ACC CCT CCC 768 
Gin Ala Asp Val His Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 
245 250 255 

TAC GCA GAC CCC AGC CTG CAG GCT CCT GTG CGT GTC TCC ATG CAG CTG 816 
Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val Arg Val Ser Met Gin Leu 
260 265 270 

CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG CCC ATG GAA TTC CAG TAC 864 
Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu Pro Met Glu Phe Gin Tyr 
275 280 285 

CTG CCA GAT ACA GAC GAT CGT CAC CGG ATT GAG GAG AAA CGT AAA AGG 912 
Leu Pro Asp Thr Asp Asp Arg His Arg He Glu Glu Lys Arg Lys Arg 
290 295 300 

ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG AAG AGT CCT TTC AGC GGA 960 
Thr Tyr Glu Thr Phe Lys Ser He Met Lys Lys Ser Pro Phe Ser Gly 
305 310 315 320 

CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC ATT GCT GTG CCT TCC CGC 1008 
Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg He Ala Val Pro Ser Arg 
325 330 335 

AGC TCA GCT TCT GTC CCC AAG CCA GCA CCC CAG CCC TAT CCC TTT ACG 1056 
Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 
340 345 350 

TCA TCC CTG AGC ACC ATC AAC TAT GAT GAG TTT CCC ACC ATG GTG TTT 1104 
Ser Ser Leu Ser Thr He Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 
355 360 365 

CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC TTG GCC CCG GCC CCT CCC 1152 
Pro Ser Gly Gin He Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 
370 375 380 

CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC CCT GCT CCA GCC ATG GTA 1200 
Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 
385 390 395 400 

TCA GCT CTG GCC CAG GCC CCA GCC CCT GTC CCA GTC CTA GCC CCA GGC 1248 
Ser Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly 
405 410 415 

CCT CCT CAG GCT GTG GCC CCA CCT GCC CCC AAG CCC ACC CAG GCT GGG 1296 
Pro Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly 
420 425 430 

GAA GGA ACG CTG TCA GAG GCC CTG CTG CAG CTG CAG TTT GAT GAT GAA 1344 
Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 
435 440 445 

GAC CTG GGG GCC TTG CTT GGC AAC AGC ACA GAC CCA GCT GTG TTC ACA 1392 
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Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 
450 455 460 

GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT CAG CAG CTG CTG AAC CAG 1440 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
465 470 475 480 

GGC ATA CCT GTG GCC CCC CAC ACA ACT GAG CCC ATG CTG ATG GAG TAC 1488 

Gly lie Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr 
485 490 495 

CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG GCC CAG AGG CCC CCC GAC 15 36 

Pro Glu Ala lie Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 
500 505 510 

CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG CTC CCC AAT GGC CTC CTT 1584 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 
515 520 525 

TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG GAC ATG GAC TTC TCA GCC 1632 

Ser Gly Asp Glu Asp Phe Ser Ser lie Ala Asp Met Asp Phe Ser Ala 
530 535 540 

CTG CTG AGT CAG ATC AGC TCC TTG GAT CCA CCG GTC GCC ACC ATG GTG 1680 

Leu Leu Ser Gin lie Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
545 550 555 560 

AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG 1728 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu 
565 570 575 

CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC 1776 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
580 585 590 

GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC 1824 

Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 
595 600 605 

ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC 1872 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
610 615 620 

TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC 1920 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
625 630 635 640 

GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC 1968 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 
645 650 655 

ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG 2016 

lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
660 665 670 

TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC 2 064 

Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 
675 680 685 



TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC 2112 
Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr 
690 695 700 

AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC 2160 
Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He 
705 710 715 720 

AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG 2208 
Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin 
725 730 735 

CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG 2256 
Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val 
740 745 750 

CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA 2304 
Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 
755 760 765 

GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC 23 52 
Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
770 775 780 

GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 2394 
Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Met Asp Glu Leu Phe Pro Leu He Phe Pro Ala Glu Pro Ala Gin Ala 

15 10 15 

Ser Gly Pro Tyr Val Glu He lie Glu Gin Pro Lys Gin Arg Gly Met 

20 25 30 

Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser lie Pro Gly 

35 40 45 

Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pro Thr lie Lys He Asn 

50 55 60 

Gly Tyr Thr Gly Pro Gly Thr Val Arg He Ser Leu Val Thr Lys Asp 
65 70 75 80 

Pro Pro His Arg Pro His Pro His Glu Leu Val Gly Lys Asp Cys Arg 

85 90 95 

Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro Asp Arg Cys lie His Ser 

100 105 HO 

Phe Gin Asn Leu Gly He Gin Cys Val Lys Lys Arg Asp Leu Glu Gin 
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115 120 125 

Ala He Ser Gin Arg He Gin Thr Asn Asn Asn Pro Phe Gin Val Pro 

130 135 140 

He Glu Glu Gin Arg Gly Asp Tyr Asp Leu Asn Ala Val Arg Leu Cys 
145 150 155 160 

Phe Gin Val Thr Val Arg Asp Pro Ser Gly Arg Pro Leu Arg Leu Pro 

165 170 175 

Pro Val Leu Pro His Pro He Phe Asp Asn Arg Ala Pro Asn Thr Ala 

180 185 190 

Glu Leu Lys He Cys Arg Val Asn Arg Asn Ser Gly Ser Cys Leu Gly 

195 200 205 

Gly Asp Glu He Phe Leu Leu Cys Asp Lys Val Gin Lys Glu Asp He 

210 215 220 

Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Ala Arg Gly Ser Phe Ser 
225 230 235 240 

Gin Ala Asp Val His Arg Gin Val Ala He Val Phe Arg Thr Pro Pro 

245 250 255 

Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val Arg Val Ser Met Gin Leu 

260 265 270 

Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu Pro Met Glu Phe Gin Tyr 

275 280 285 

Leu Pro Asp Thr Asp Asp Arg His Arg He Glu Glu Lys Arg Lys Arg 

290 295 300 

Thr Tyr Glu Thr Phe Lys Ser He Met Lys Lys Ser Pro Phe Ser Gly 
305 310 315 320 

Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg He Ala Val Pro Ser Arg 

325 330 335 

Ser Ser Ala Ser Val Pro Lys Pro Ala Pro Gin Pro Tyr Pro Phe Thr 

340 345 350 

Ser Ser Leu Ser Thr He Asn Tyr Asp Glu Phe Pro Thr Met Val Phe 

355 360 365 

Pro Ser Gly Gin He Ser Gin Ala Ser Ala Leu Ala Pro Ala Pro Pro 

370 375 380 

Gin Val Leu Pro Gin Ala Pro Ala Pro Ala Pro Ala Pro Ala Met Val 
385 390 395 400 

Ser Ala Leu Ala Gin Ala Pro Ala Pro Val Pro Val Leu Ala Pro Gly 

405 410 415 

Pro Pro Gin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr Gin Ala Gly 

420 425 430 

Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin Leu Gin Phe Asp Asp Glu 

435 440 445 

Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala Val Phe Thr 

450 455 460 

Asp Leu Ala Ser Val Asp Asn Ser Glu Phe Gin Gin Leu Leu Asn Gin 
465 470 475 480 

Gly He Pro Val Ala Pro His Thr Thr Glu Pro Met Leu Met Glu Tyr 

485 490 495 

Pro Glu Ala He Thr Arg Leu Val Thr Gly Ala Gin Arg Pro Pro Asp 

500 505 510 

Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly Leu Pro Asn Gly Leu Leu 

515 520 525 

Ser Gly Asp Glu Asp Phe Ser Ser He Ala Asp Met Asp Phe Ser Ala 

530 535 540 

Leu Leu Ser Gin He Ser Ser Leu Asp Pro Pro Val Ala Thr Met Val 
545 550 555 560 

Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu 

565 570 575 

Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
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Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr 

595 600 605 

Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 

610 615 620 

Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His 
625 630 635 640 

Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr 

645 650 655 

lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 

660 665 670 

Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp 

675 680 685 

Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr 

690 695 700 

Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie 
705 710 715 720 

Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin 

725 730 735 

Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val 

740 745 750 

Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys 

755 760 765 

Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 

770 775 780 

Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2391 
<D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 



20 



25 



30 



GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



144 



TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 



192 
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Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2 88 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 3 84 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 528 
Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 57 6 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TCC 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

GGA CTC AGA TCT CGA GCC ATG GAC GAA CTG TTC CCC CTC ATC TTC CCG 7 68 

Gly Leu Arg Ser Arg Ala Met Asp Glu Leu Phe Pro Leu lie Phe Pro 
245 250 255 

GCA GAG CCA GCC CAG GCC TCT GGC CCC TAT GTG GAG ATC ATT GAG CAG 816 
Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He He Glu Gin 
260 265 270 

CCC AAG CAG CGG GGC ATG CGC TTC CGC TAC AAG TGC GAG GGG CGC TCC 864 
Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser 
275 280 285 
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GCG GGC AGC ATC CCA GGC GAG AGG AGC ACA GAT ACC ACC AAG ACC CAC 912 
Ala Gly Ser lie Pro Gly Glu Arg Ser Thr Asp Thr Thr Lys Thr His 
290 295 300 

CCC ACC ATC AAG ATC AAT GGC TAC ACA GGA CCA GGG ACA GTG CGC ATC 960 
Pro Thr lie Lys lie Asn Gly Tyr Thr Gly Pro Gly Thr Val Arg lie 
305 310 315 320 

TCC CTG GTC ACC AAG GAC CCT CCT CAC CGG CCT CAC CCC CAC GAG CTT 1008 
Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro His Glu Leu 
325 330 335 

GTA GGA AAG GAC TGC CGG GAT GGC TTC TAT GAG GCT GAG CTC TGC CCG 1056 
Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro 
340 345 350 

GAC CGC TGC ATC CAC AGT TTC CAG AAC CTG GGA ATC CAG TGT GTG AAG 1104 
Asp Arg Cys He His Ser Phe Gin Asn Leu Gly He Gin Cys Val Lys 
355 360 365 

AAG CGG GAC CTG GAG CAG GCT ATC AGT CAG CGC ATC CAG ACC AAC AAC 1152 
Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin Thr Asn Asn 
370 375 380 

AAC CCC TTC CAA GTT CCT ATA GAA GAG CAG CGT GGG GAC TAC GAC CTG 1200 
Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp Tyr Asp Leu 
385 390 395 400 

AAT GCT GTG CGG CTC TGC TTC CAG GTG ACA GTG CGG GAC CCA TCA GGC 1248 
Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp Pro Ser Gly 
405 410 415 

AGG CCC CTC CGC CTG CCG CCT GTC CTT CCT CAT CCC ATC TTT GAC AAT 1296 
Arg Pro Leu Arg Leu Pro Pro Val Leu Pro His Pro He Phe Asp Asn 
420 425 430 

CGT GCC CCC AAC ACT GCC GAG CTC AAG ATC TGC CGA GTG AAC CGA AAC 1344 
Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val Asn Arg Asn 
435 440 445 

TCT GGC AGC TGC CTC GGT GGG GAT GAG ATC TTC CTA CTG TGT GAC AAG 1392 
Ser Gly Ser Cys Leu Gly Gly Asp Glu He Phe Leu Leu Cys Asp Lys 
450 455 460 

GTG CAG AAA GAG GAC ATT GAG GTG TAT TTC ACG GGA CCA GGC TGG GAG 1440 
Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu 
465 470 475 480 

GCC CGA GGC TCC TTT TCG CAA GCT GAT GTG CAC CGA CAA GTG GCC ATT 1488 
Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin Val Ala He 
485 490 495 

GTG TTC CGG ACC CCT CCC TAC GCA GAC CCC AGC CTG CAG GCT CCT GTG 1536 
Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val 
500 505 510 

CGT GTC TCC ATG CAG CTG CGG CGG CCT TCC GAC CGG GAG CTC AGT GAG 1584 



Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu 
515 520 525 

CCC ATG GAA TTC CAG TAC CTG CCA GAT ACA GAC GAT CGT CAC CGG ATT 1632 
Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg His Arg lie 
530 535 540 

GAG GAG AAA CGT AAA AGG ACA TAT GAG ACC TTC AAG AGC ATC ATG AAG 1680 
Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser lie Met Lys 
545 550 555 560 

AAG AGT CCT TTC AGC GGA CCC ACC GAC CCC CGG CCT CCA CCT CGA CGC 172 8 
Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 
565 570 575 

ATT GCT GTG CCT TCC CGC AGC TCA GCT TCT GTC CCC AAG CCA GCA CCC 1776 
lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala Pro 
580 585 590 

CAG CCC TAT CCC TTT ACG TCA TCC CTG AGC ACC ATC AAC TAT GAT GAG 182 4 
Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn Tyr Asp Glu 
595 600 605 

TTT CCC ACC ATG GTG TTT CCT TCT GGG CAG ATC AGC CAG GCC TCG GCC 1872 
Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin Ala Ser Ala 
610 615 620 

TTG GCC CCG GCC CCT CCC CAA GTC CTG CCC CAG GCT CCA GCC CCT GCC 1920 
Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 
625 630 635 640 

CCT GCT CCA GCC ATG GTA TCA GCT CTG GCC CAG GCC CCA GCC CCT GTC 1968 
Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro Val 
645 650 655 

CCA GTC CTA GCC CCA GGC CCT CCT CAG GCT GTG GCC CCA CCT GCC CCC 2016 
Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 
660 665 670 

AAG CCC ACC CAG GCT GGG GAA GGA ACG CTG TCA GAG GCC CTG CTG CAG 2064 
Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 
675 680 685 

CTG CAG TTT GAT GAT GAA GAC CTG GGG GCC TTG CTT GGC AAC AGC ACA 2112 
Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 
690 695 700 

GAC CCA GCT GTG TTC ACA GAC CTG GCA TCC GTC GAC AAC TCC GAG TTT 2160 
Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 
705 710 715 720 

CAG CAG CTG CTG AAC CAG GGC ATA CCT GTG GCC CCC CAC ACA ACT GAG 2208 
Gin Gin Leu Leu Asn Gin Gly lie Pro Val Ala Pro His Thr Thr Glu 
725 730 735 

CCC ATG CTG ATG GAG TAC CCT GAG GCT ATA ACT CGC CTA GTG ACA GGG 22 56 
Pro Met Leu Met Glu Tyr Pro Glu Ala lie Thr Arg Leu Val Thr Gly 
740 745 750 
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GCC CAG AGG CCC CCC GAC CCA GCT CCT GCT CCA CTG GGG GCC CCG GGG 23 04 

Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 
755 760 765 

CTC CCC AAT GGC CTC CTT TCA GGA GAT GAA GAC TTC TCC TCC ATT GCG 2 3 52 

Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser lie Ala 
770 775 780 

GAC ATG GAC TTC TCA GCC CTG CTG AGT CAG ATC AGC TCC TAA 23 94 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin lie Ser Ser 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 

15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 

115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Met Asp Glu Leu Phe Pro Leu He Phe Pro 



245 250 255 

Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu lie He Glu Gin 

260 265 270 

Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser 

275 280 285 

Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr Lys Thr His 

290 295 300 

Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Gly Thr Val Arg He 
305 310 315 320 

Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro His Glu Leu 

325 330 335 

Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro 

340 345 350 

Asp Arg Cys He His Ser Phe Gin Asn Leu Gly He Gin Cys Val Lys 

355 360 365 

Lys Arg Asp Leu Glu Gin Ala lie Ser Gin Arg lie Gin Thr Asn Asn 

370 375 380 

Asn Pro Phe Gin Val Pro lie Glu Glu Gin Arg Gly Asp Tyr Asp Leu 
385 390 395 400 

Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp Pro Ser Gly 

405 410 415 

Arg Pro Leu Arg Leu Pro Pro Val Leu Pro His Pro lie Phe Asp Asn 

420 425 430 

Arg Ala Pro Asn Thr Ala Glu Leu Lys lie Cys Arg Val Asn Arg Asn 

435 440 445 

Ser Gly Ser Cys Leu Gly Gly Asp Glu lie Phe Leu Leu Cys Asp Lys 

450 455 460 

Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu 
465 470 475 480 

Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin Val Ala lie 

485 490 495 

Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin Ala Pro Val 

500 505 510 

Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu Leu Ser Glu 

515 520 525 

Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg His Arg lie 

530 535 540 

Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser He Met Lys 
545 550 555 560 

Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro Pro Arg Arg 

565 570 575 

lie Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys Pro Ala Pro 

580 585 590 

Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr lie Asn Tyr Asp Glu 

595 600 605 

Phe Pro Thr Met Val Phe Pro Ser Gly Gin lie Ser Gin Ala Ser Ala 

610 615 620 

Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro Ala Pro Ala 
625 630 635 640 

Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro Ala Pro Val 

645 650 655 

Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro Pro Ala Pro 

660 665 670 

Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala Leu Leu Gin 

675 680 685 

Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly Asn Ser Thr 

690 695 700 

Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn Ser Glu Phe 



705 710 715 720 

Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His Thr Thr Glu 

725 730 735 

Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu Val Thr Gly 

740 745 750 

Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala Pro Gly 

755 760 765 

Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser He Ala 

770 775 780 

Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3378 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

ATG GAG CGG GCC GGC CCC AGC TTC GGG CAG CAG CGA CAG CAG CAG CAG 48 
Met Glu Arg Ala Gly Pro Ser Phe Gly Gin Gin Arg Gin Gin Gin Gin 
15 10 15 

CCC CAG CAG CAG AAG CAG CAG CAG AGG GAT CAG GAC TCG GTC GAA GCA 96 
Pro Gin Gin Gin Lys Gin Gin Gin Arg Asp Gin Asp Ser Val Glu Ala 
20 25 30 

TGG CTG GAC GAT CAC TGG GAC TTT ACC TTC TCA TAC TTT GTT AGA AAA 144 
Trp Leu Asp Asp His Trp Asp Phe Thr Phe Ser Tyr Phe Val Arg Lys 
35 40 45 

GCC ACC AGA GAA ATG GTC AAT GCA TGG TTT GCT GAG AGA GTT CAC ACC 192 
Ala Thr Arg Glu Met Val Asn Ala Trp Phe Ala Glu Arg Val His Thr 
50 55 60 

ATC CCT GTG TGC AAG GAA GGT ATC AGA GGC CAC ACC GAA TCT TGC TCT 240 
He Pro Val Cys Lys Glu Gly lie Arg Gly His Thr Glu Ser Cys Ser 
65 70 75 80 

TGT CCC TTG CAG CAG AGT CCT CGT GCA GAT AAC AGT GTC CCT GGA ACA 288 
Cys Pro Leu Gin Gin Ser Pro Arg Ala Asp Asn Ser Val Pro Gly Thr 
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85 90 95 

CCA ACC AGG AAA ATC TCT GCC TCT GAA TTT GAC CGG CCT CTT AGA CCC 3 36 

Pro Thr Arg Lys lie Ser Ala Ser Glu Phe Asp Arg Pro Leu Arg Pro 
100 105 110 

ATT GTT GTC AAG GAT TCT GAG GGA ACT GTG AGC TTC CTC TCT GAC TCA 3 84 

He Val Val Lys Asp Ser Glu Gly Thr Val Ser Phe Leu Ser Asp Ser 
115 120 125 

GAA AAG AAG GAA CAG ATG CCT CTA ACC CCT CCA AGG TTT GAT CAT GAT 432 
Glu Lys Lys Glu Gin Met Pro Leu Thr Pro Pro Arg Phe Asp His Asp 
130 135 140 

GAA GGG GAC CAG TGC TCA AGA CTC TTG GAA TTA GTG AAG GAT ATT TCT 480 
Glu Gly Asp Gin Cys Ser Arg Leu Leu Glu Leu Val Lys Asp He Ser 
145 150 155 160 

AGT CAT TTG GAT GTC ACA GCC TTA TGT CAC AAA ATT TTC TTG CAT ATC 528 
Ser His Leu Asp Val Thr Ala Leu Cys His Lys He Phe Leu His He 
165 170 175 

CAT GGA CTG ATA TCT GCT GAC CGC TAT TCC CTG TTC CTT GTC TGT GAA 576 
His Gly Leu He Ser Ala Asp Arg Tyr Ser Leu Phe Leu Val Cys Glu 
180 185 190 

GAC AGC TCC AAT GAC AAG TTT CTT ATC AGC CGC CTC TTT GAT GTT GCT 624 
Asp Ser Ser Asn Asp Lys Phe Leu lie Ser Arg Leu Phe Asp Val Ala 
195 200 205 

GAA GGT TCA ACA CTG GAA GAA GTT TCA AAT AAC TGT ATC CGC TTA GAA 672 
Glu Gly Ser Thr Leu Glu Glu Val Ser Asn Asn Cys He Arg Leu Glu 
210 215 220 

TGG AAC AAA GGC ATT GTG GGA CAT GTG GCA GCG CTT GGT GAG CCC TTG 720 
Trp Asn Lys Gly He Val Gly His Val Ala Ala Leu Gly Glu Pro Leu 
225 230 235 240 

AAC ATC AAA GAT GCA TAT GAG GAT CCT CGG TTC AAT GCA GAA GTT GAC 768 
Asn He Lys Asp Ala Tyr Glu Asp Pro Arg Phe Asn Ala Glu Val Asp 
245 250 255 

CAA ATT ACA GGC TAC AAG ACA CAA AGC ATT CTT TGT ATG CCA ATT AAG 816 
Gin He Thr Gly Tyr Lys Thr Gin Ser He Leu Cys Met Pro He Lys 
260 265 270 

AAT CAT AGG GAA GAG GTT GTT GGT GTA GCC CAG GCC ATC AAC AAG AAA 864 
Asn His Arg Glu Glu Val Val Gly Val Ala Gin Ala He Asn Lys Lys 
275 280 285 

TCA GGA AAC GGT GGG ACA TTT ACT GAA AAA GAT GAA AAG GAC TTT GCT 912 
Ser Gly Asn Gly Gly Thr Phe Thr Glu Lys Asp Glu Lys Asp Phe Ala 
290 295 300 



2.3* 



GCT TAT TTG GCA TTT TGT GGT ATT GTT CTT CAT AAT GCT CAG CTC TAT 960 
Ala Tyr Leu Ala Phe Cys Gly lie Val Leu His Asn Ala Gin Leu Tyr 
305 310 315 320 

GAG ACT TCA CTG CTG GAG AAC AAG AGA AAT CAG GTG CTG CTT GAC CTT 1008 
Glu Thr Ser Leu Leu Glu Asn Lys Arg Asn Gin Val Leu Leu Asp Leu 
325 330 335 

GCT AGT TTA ATT TTT GAA GAA CAA CAA TCA TTA GAA GTA ATT TTG AAG 1056 
Ala Ser Leu lie Phe Glu Glu Gin Gin Ser Leu Glu Val lie Leu Lys 
340 345 350 

AAA ATA GCT GCC ACT ATT ATC TCT TTC ATG CAA GTG CAG AAA TGC ACC 1104 
Lys lie Ala Ala Thr He He Ser Phe Met Gin Val Gin Lys Cys Thr 
355 360 365 

ATT TTC ATA GTG GAT GAA GAT TGC TCC GAT TCT TTT TCT AGT GTG TTT 1152 
He Phe He Val Asp Glu Asp Cys Ser Asp Ser Phe Ser Ser Val Phe 
370 375 380 

CAC ATG GAG TGT GAG GAA TTA GAA AAA TCA TCT GAT ACA TTA ACA AGG 1200 
His Met Glu Cys Glu Glu Leu Glu Lys Ser Ser Asp Thr Leu Thr Arg 
385 390 395 400 

GAA CAT GAT GCA AAC AAA ATC AAT TAC ATG TAT GCT CAG TAT GTC AAA 1248 
Glu His Asp Ala Asn Lys lie Asn Tyr Met Tyr Ala Gin Tyr Val Lys 
405 410 415 

AAT ACT ATG GAA CCA CTT AAT ATC CCA GAT GTC AGT AAG GAT AAA AGA 1296 
Asn Thr Met Glu Pro Leu Asn He Pro Asp Val Ser Lys Asp Lys Arg 
420 425 430 

TTT CCC TGG ACA ACT GAA AAT ACA GGA AAT GTA AAC CAG CAG TGC ATT 1344 
Phe Pro Trp Thr Thr Glu Asn Thr Gly Asn Val Asn Gin Gin Cys He 
435 440 445 

AGA AGT TTG CTT TGT ACA CCT ATA AAA AAT GGA AAG AAG AAT AAA GTT 1392 
Arg Ser Leu Leu Cys Thr Pro lie Lys Asn Gly Lys Lys Asn Lys Val 
450 455 460 

ATA GGG GTT TGC CAA CTT GTT AAT AAG ATG GAG GAG AAT ACT GGC AAG 1440 
He Gly Val Cys Gin Leu Val Asn Lys Met Glu Glu Asn Thr Gly Lys 
465 470 475 480 

GTT AAG CCT TTC AAC CGA AAT GAC GAA CAG TTT CTG GAA GCT TTT GTC 1488 
Val Lys Pro Phe Asn Arg Asn Asp Glu Gin Phe Leu Glu Ala Phe Val 
485 490 495 

ATC TTT TGT GGC TTG GGG ATC CAG AAC ACG CAG ATG TAT GAA GCA GTG 1536 
He Phe Cys Gly Leu Gly He Gin Asn Thr Gin Met Tyr Glu Ala Val 
500 505 510 

GAG AGA GCC ATG GCC AAG CAA ATG GTC ACA TTG GAG GTT CTG TCG TAT 1584 
Glu Arg Ala Met Ala Lys Gin Met Val Thr Leu Glu Val Leu Ser Tyr 
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515 520 525 

CAT GCT TCA GCA GCA GAG GAA GAA ACA AGA GAG CTA CAG TCG TTA GCG 1632 
His Ala Ser Ala Ala Glu Glu Glu Thr Arg Glu Leu Gin Ser Leu Ala 
530 535 540 

GCT GCT GTG GTG CCA TCT GCC CAG ACC CTT AAA ATT ACT GAC TTT AGC 1680 
Ala Ala Val Val Pro Ser Ala Gin Thr Leu Lys He Thr Asp Phe Ser 
545 550 555 560 

TTC AGT GAC TTT GAG CTG TCT GAT CTG GAA ACA GCA CTG TGC ACA ATT 1728 
Phe Ser Asp Phe Glu Leu Ser Asp Leu Glu Thr Ala Leu Cys Thr He 
565 570 575 

CGG ATG TTT ACT GAC CTC AAC CTT GTG CAG AAC TTC CAG ATG AAA CAT 1776 
Arg Met Phe Thr Asp Leu Asn Leu Val Gin Asn Phe Gin Met Lys His 
580 585 590 

GAG GTT CTT TGC AGA TGG ATT TTA AGT GTT AAG AAG AAT TAT CGG AAG 1824 
Glu Val Leu Cys Arg Trp He Leu Ser Val Lys Lys Asn Tyr Arg Lys 
595 600 605 

AAT GTT GCC TAT CAT AAT TGG AGA CAT GCC TTT AAT ACA GCT CAG TGC 1872 
Asn Val Ala Tyr His Asn Trp Arg His Ala Phe Asn Thr Ala Gin Cys 
610 615 620 

ATG TTT GCT GCT CTA AAA GCA GGC AAA ATT CAG AAC AAG CTG ACT GAC 1920 
Met Phe Ala Ala Leu Lys Ala Gly Lys He Gin Asn Lys Leu Thr Asp 
625 630 635 640 

CTG GAG ATA CTT GCA TTG CTG ATT GCT GCA CTA AGC CAC GAT TTG GAT 1968 
Leu Glu He Leu Ala Leu Leu He Ala Ala Leu Ser His Asp Leu Asp 
645 650 655 

CAC CGT GGT GTG AAT AAC TCT TAC ATA CAG CGA AGT GAA CAT CCA CTT 2 016 
His Arg Gly Val Asn Asn Ser Tyr He Gin Arg Ser Glu His Pro Leu 
660 665 670 

GCC CAG CTT TAC TGC CAT TCA ATC ATG GAA CAC CAT CAT TTT GAC CAG 2064 
Ala Gin Leu Tyr Cys His Ser He Met Glu His His His Phe Asp Gin 
675 680 685 

TGC CTG ATG ATT CTT AAT AGT CCA GGC AAT CAG ATT CTC AGT GGC CTC 2112 
Cys Leu Met He Leu Asn Ser Pro Gly Asn Gin He Leu Ser Gly Leu 
690 695 700 

TCC ATT GAA GAA TAT AAG ACC ACG TTG AAA ATA ATC AAG CAA GCT ATT 2160 
Ser He Glu Glu Tyr Lys Thr Thr Leu Lys He He Lys Gin Ala He 
705 710 715 720 

TTA GCT ACA GAC CTA GCA CTG TAC ATT AAG AGG CGA GGA GAA TTT TTT 2208 
Leu Ala Thr Asp Leu Ala Leu Tyr He Lys Arg Arg Gly Glu Phe Phe 
725 730 735 



2 Y& 



GAA CTT ATA 
Giu Leu He 



GAG TTG TTT 
Glu Leu Phe 
755 

ACA AAA CCC 
Thr Lys Pro 
770 

GAA TTT TTT 
Glu Phe Phe 
785 

CCC ACT GAT 
Pro Thr Asp 



CAA GTT GGG 
Gin Val Gly 



ACC CAC GTG 
Thr His Val 
835 

AAC AGG CAG 
Asn Arg Gin 
850 

ATT AAT GGG 
He Asn Gly 
865 

CGG GAT CCA 
Arg Asp Pro 

ACC GGG GTG 
Thr Gly Val 

CAC AAG TTC 
His Lys Phe 
915 

AAG CTG ACC 
Lys Leu Thr 
930 

TGG CCC ACC 
Trp Pro Thr 



AGA AAA AAT 
Arg Lys Asn 
740 

TTG GCA ATG 
Leu Ala Met 



TGG CCT ATT 
Trp Pro He 

GAT CAA GGA 
Asp Gin Gly 
790 

CTA ATG AAC 
Leu Met Asn 
805 

TTC ATA GAT 
Phe He Asp 
820 

TCA GAG GAC 
Ser Glu Asp 

AAA TGG CAG 
Lys Trp Gin 

GAA AGC GGC 
Glu Ser Gly 
870 

CCG GTC GCC 
Pro Val Ala 
885 

GTG CCC ATC 
Val Pro lie 
900 

AGC GTG TCC 
Ser Val Ser 



CTG AAG TTC 
Leu Lys Phe 

CTC GTG ACC 
Leu Val Thr 



CAA TTC AAT 
Gin Phe Asn 
745 

CTG ATG ACA 
Leu Met Thr 
760 

CAA CAA CGG 
Gin Gin Arg 
775 

GAC AGA GAG 
Asp Arg Glu 

AGG GAG AAG 
Arg Glu Lys 

GCC ATC TGC 
Ala He Cys 
825 

TGT TTC CCT 
Cys Phe Pro 
840 

GCC CTT GCA 
Ala Leu Ala 
855 

CAG GCC AAG 
Gin Ala Lys 



ACC ATG GTG 
Thr Met Val 



CTG GTC GAG 
Leu Val Glu 
905 

GGC GAG GGC 
Gly Glu Gly 
920 

ATC TGC ACC 
He Cys Thr 
935 

ACC CTG ACC 
Thr Leu Thr 



TTG GAA GAT 
Leu Glu Asp 

GCT TGT GAT 
Ala Cys Asp 

ATA GCA GAA 
He Ala Glu 
780 

AGA AAA GAA 
Arg Lys Glu 
795 

AAA AAC AAA 
Lys Asn Lys 
810 

TTG CAA CTG 
Leu Gin Leu 

TTG CTA GAT 
Leu Leu Asp 

GAA CAG CAG 
Glu Gin Gin 
860 

CGG AAC TGG 
Arg Asn Trp 
875 

AGC AAG GGC 
Ser Lys Gly 
890 

CTG GAC GGC 
Leu Asp Gly 

GAG GGC GAT 
Glu Gly Asp 



ACC GGC AAG 
Thr Gly Lys 
940 

TAC GGC GTG 
Tyr Gly Val 



CCT CAT CAA 
Pro His Gin 
750 

CTT TCT GCA 
Leu Ser Ala 
765 

CTT GTA GCA 
Leu Val Ala 



CTC AAC ATA 
Leu Asn He 



ATC CCA AGT 
He Pro Ser 
815 

TAT GAG GCC 
Tyr Glu Ala 
830 

GGC TGC AGA 
Gly Cys Arg 
845 

GAG AAG ATG 
Glu Lys Met 

GTA CCG CGG 
Val Pro Arg 

GAG GAG CTG 
Glu Glu Leu 
895 

GAC GTA AAC 
Asp Val Asn 
910 

GCC ACC TAC 
Ala Thr Tyr 
925 

CTG CCC GTG 
Leu Pro Val 



CAG TGC TTC 
Gin Cys Phe 



AAG 2256 
Lys 

ATT 2 3 04 
He 



ACT 2 3 52 
Thr 



GAA 2400 

Glu 

800 

ATG 2448 
Met 



CTG 2496 
Leu 



AAG 2544 
Lys 

CTG 2592 
Leu 



GCC 2640 

Ala 

880 

TTC 2688 
Phe 



GGC 2736 
Gly 



GGC 2784 
Gly 



CCC 2832 
Pro 



AGC 2880 
Ser 



945 950 955 960 

CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC ATG 2928 
Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 
965 970 975 

CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC GGC 2976 
Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly 
980 985 990 

AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG 3 024 
Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
995 1000 1005 

AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC 3072 
Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 
1010 1015 1020 

CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC 3120 
Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
1025 1030 1035 1040 

ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC 3168 
Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 
1045 1050 1055 

CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG 3216 
His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 
1060 1065 1070 

AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC 32 64 
Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 
1075 1080 1085 

CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT 3312 
Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
1090 1095 1100 

CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC 33 60 
His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
1105 1110 1115 1120 

ATG GAC GAG CTG TAC AAG TAA 3 3 81 

Met Asp Glu Leu Tyr Lys 
1125 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 1126 amino acids 
(B) TYPE: amino acid 
{ C ) STRANDEDNES S : s ing 1 e 
( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

Met Glu Arg Ala Gly Pro Ser Phe Gly Gin Gin Arg Gin Gin Gin Gin 

15 10 15 

Pro Gin Gin Gin Lys Gin Gin Gin Arg Asp Gin Asp Ser Val Glu Ala 

20 25 30 

Trp Leu Asp Asp His Trp Asp Phe Thr Phe Ser Tyr Phe Val Arg Lys 

35 40 45 

Ala Thr Arg Glu Met Val Asn Ala Trp Phe Ala Glu Arg Val His Thr 

50 55 60 

He Pro Val Cys Lys Glu Gly He Arg Gly His Thr Glu Ser Cys Ser 
65 70 75 80 

Cys Pro Leu Gin Gin Ser Pro Arg Ala Asp Asn Ser Val Pro Gly Thr 

85 90 95 

Pro Thr Arg Lys He Ser Ala Ser Glu Phe Asp Arg Pro Leu Arg Pro 

100 105 HO 

He Val Val Lys Asp Ser Glu Gly Thr Val Ser Phe Leu Ser Asp Ser 

115 120 125 

Glu Lys Lys Glu Gin Met Pro Leu Thr Pro Pro Arg Phe Asp His Asp 

130 135 140 

Glu Gly Asp Gin Cys Ser Arg Leu Leu Glu Leu Val Lys Asp He Ser 
145 150 155 160 

Ser His Leu Asp Val Thr Ala Leu Cys His Lys He Phe Leu His He 

165 170 175 

His Gly Leu He Ser Ala Asp Arg Tyr Ser Leu Phe Leu Val Cys Glu 

180 185 190 

Asp Ser Ser Asn Asp Lys Phe Leu He Ser Arg Leu Phe Asp Val Ala 

195 200 205 

Glu Gly Ser Thr Leu Glu Glu Val Ser Asn Asn Cys lie Arg Leu Glu 

210 215 220 

Trp Asn Lys Gly lie Val Gly His Val Ala Ala Leu Gly Glu Pro Leu 
225 230 235 240 

Asn lie Lys Asp Ala Tyr Glu Asp Pro Arg Phe Asn Ala Glu Val Asp 

245 250 255 

Gin lie Thr Gly Tyr Lys Thr Gin Ser lie Leu Cys Met Pro lie Lys 

260 265 270 

Asn His Arg Glu Glu Val Val Gly Val Ala Gin Ala lie Asn Lys Lys 

275 280 285 

Ser Gly Asn Gly Gly Thr Phe Thr Glu Lys Asp Glu Lys Asp Phe Ala 

290 295 300 

Ala Tyr Leu Ala Phe Cys Gly lie Val Leu His Asn Ala Gin Leu Tyr 
305 310 315 320 

Glu Thr Ser Leu Leu Glu Asn Lys Arg Asn Gin Val Leu Leu Asp Leu 

325 330 335 

Ala Ser Leu He Phe Glu Glu Gin Gin Ser Leu Glu Val He Leu Lys 

340 345 350 

Lys lie Ala Ala Thr lie lie Ser Phe Met Gin Val Gin Lys Cys Thr 

355 360 365 

lie Phe lie Val Asp Glu Asp Cys Ser Asp Ser Phe Ser Ser Val Phe 
370 375 380 
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His Met Glu Cys Glu Glu Leu Glu Lys Ser Ser Asp Thr Leu Thr Arg 
385 390 395 400 

Glu His Asp Ala Asn Lys lie Asn Tyr Met Tyr Ala Gin Tyr Val Lys 

405 410 415 

Asn Thr Met Glu Pro Leu Asn lie Pro Asp Val Ser Lys Asp Lys Arg 

420 425 430 

Phe Pro Trp Thr Thr Glu Asn Thr Gly Asn Val Asn Gin Gin Cys lie 

435 440 445 

Arg Ser Leu Leu Cys Thr Pro lie Lys Asn Gly Lys Lys Asn Lys Val 

450 455 460 

lie Gly Val Cys Gin Leu Val Asn Lys Met Glu Glu Asn Thr Gly Lys 
465 470 475 480 

Val Lys Pro Phe Asn Arg Asn Asp Glu Gin Phe Leu Glu Ala Phe Val 

485 490 495 

lie Phe Cys Gly Leu Gly lie Gin Asn Thr Gin Met Tyr Glu Ala Val 

500 505 510 

Glu Arg Ala Met Ala Lys Gin Met Val Thr Leu Glu Val Leu Ser Tyr 

515 520 525 

His Ala Ser Ala Ala Glu Glu Glu Thr Arg Glu Leu Gin Ser Leu Ala 

530 535 540 

Ala Ala Val Val Pro Ser Ala Gin Thr Leu Lys lie Thr Asp Phe Ser 
545 550 555 560 

Phe Ser Asp Phe Glu Leu Ser Asp Leu Glu Thr Ala Leu Cys Thr lie 

565 570 575 

Arg Met Phe Thr Asp Leu Asn Leu Val Gin Asn Phe Gin Met Lys His 

580 585 590 

Glu Val Leu Cys Arg Trp lie Leu Ser Val Lys Lys Asn Tyr Arg Lys 

595 600 605 

Asn Val Ala Tyr His Asn Trp Arg His Ala Phe Asn Thr Ala Gin Cys 

610 615 620 

Met Phe Ala Ala Leu Lys Ala Gly Lys lie Gin Asn Lys Leu Thr Asp 
625 630 635 640 

Leu Glu lie Leu Ala Leu Leu lie Ala Ala Leu Ser His Asp Leu Asp 

645 650 655 

His Arg Gly Val Asn Asn Ser Tyr lie Gin Arg Ser Glu His Pro Leu 

660 665 670 

Ala Gin Leu Tyr Cys His Ser lie Met Glu His His His Phe Asp Gin 

675 680 685 

Cys Leu Met lie Leu Asn Ser Pro Gly Asn Gin lie Leu Ser Gly Leu 

690 695 700 

Ser lie Glu Glu Tyr Lys Thr Thr Leu Lys lie lie Lys Gin Ala lie 
705 710 715 720 

Leu Ala Thr Asp Leu Ala Leu Tyr He Lys Arg Arg Gly Glu Phe Phe 

725 730 735 

Glu Leu He Arg Lys Asn Gin Phe Asn Leu Glu Asp Pro His Gin Lys 

740 745 750 

Glu Leu Phe Leu Ala Met Leu Met Thr Ala Cys Asp Leu Ser Ala He 

755 760 765 

Thr Lys Pro Trp Pro He Gin Gin Arg He Ala Glu Leu Val Ala Thr 

770 775 780 

Glu Phe Phe Asp Gin Gly Asp Arg Glu Arg Lys Glu Leu Asn He Glu 
785 790 795 800 

Pro Thr Asp Leu Met Asn Arg Glu Lys Lys Asn Lys He Pro Ser Met 
805 810 815 
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Gin Val Gly Phe lie Asp Ala He Cys Leu Gin Leu Tyr Glu Ala Leu 

820 825 830 

Thr His Val Ser Glu Asp Cys Phe Pro Leu Leu Asp Gly Cys Arg Lys 

835 840 845 

Asn Arg Gin Lys Trp Gin Ala Leu Ala Glu Gin Gin Glu Lys Met Leu 

850 855 860 

He Asn Gly Glu Ser Gly Gin Ala Lys Arg Asn Trp Val Pro Arg Ala 
865 870 875 880 

Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe 

885 890 895 

Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly 

900 905 910 

His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 

915 920 925 

Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro 

930 935 940 

Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
945 950 955 960 

Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met 

965 970 975 

Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly 

980 985 990 

Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val 

995 1000 1005 

Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He 

1010 1015 1020 

Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
025 1030 1035 1040 

Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg 

1045 1050 1055 

His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin 

1060 1065 1070 

Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr 

1075 1080 1085 

Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 

1090 1095 1100 

His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly 
105 1110 1115 1120 

Met Asp Glu Leu Tyr Lys 
1125 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 . . .2757 
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(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

ATG GCT GAC CCG GCT GCG GGG CCG CCG CCG AGC GAG GGC GAG GAG AGC 48 
Met Ala Asp Pro Ala Ala Gly Pro Pro Pro Ser Glu Gly Glu Glu Ser 
15 10 15 

ACC GTG CGC TTC GCC CGC AAA GGC GCC CTC CGG CAG AAG AAC GTG CAT 96 
Thr Val Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 
20 25 30 

GAG GTC AAG AAC CAC AAA TTC ACC GCC CGC TTC TTC AAG CAG CCC ACC 144 
Glu Val Lys Asn His Lys Phe Thr Ala Arg Phe Phe Lys Gin Pro Thr 
35 40 45 

TTC TGC AGC CAC TGC ACC GAC TTC ATC TGG GGC TTC GGG AAG CAG GGA 192 
Phe Cys Ser His Cys Thr Asp Phe lie Trp Gly Phe Gly Lys Gin Gly 
50 55 60 

TTC CAG TGC CAA GTT TGC TGC TTT GTG GTG CAC AAG CGG TGC CAT GAA 240 
Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

TTT GTC ACA TTC TCC TGC CCT GGC GCT GAC AAG GGT CCA GCC TCC GAT 288 
Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Ala Ser Asp 
85 90 95 

GAC CCC CGC AGC AAA CAC AAG TTT AAG ATC CAC ACG TAC TCC AGC CCC 336 
Asp Pro Arg Ser Lys His Lys Phe Lys He His Thr Tyr Ser Ser Pro 
100 105 HO 

ACG TTT TGT GAC CAC TGT GGG TCA CTG CTG TAT GGA CTC ATC CAC CAG 3 84 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 
115 120 125 

GGG ATG AAA TGT GAC ACC TGC ATG ATG AAT GTG CAC AAG CGC TGC GTG 432 
Gly Met Lys Cys Asp Thr Cys Met Met Asn Val His Lys Arg Cys Val 
130 135 140 

ATG AAT GTT CCC AGC CTG TGT GGC ACG GAC CAC ACG GAG CGC CGC GGC 480 
Met Asn Val Pro Ser Leu Cys Gly Thr Asp His Thr Glu Arg Arg Gly 
145 150 155 160 

CGC ATC TAC ATC CAG GCC CAC ATC GAC AGG GAC GTC CTC ATT GTC CTC 52 8 

Arg He Tyr He Gin Ala His He Asp Arg Asp Val Leu lie Val Leu 
165 170 175 

GTA AGA GAT GCT AAA AAC CTT GTA CCT ATG GAC CCC AAT GGC CTG TCA 57 6 

Val Arg Asp Ala Lys Asn Leu Val Pro Met Asp Pro Asn Gly Leu Ser 
180 185 190 

GAT CCC TAC GTA AAA CTG AAA CTG ATT CCC GAT CCC AAA AGT GAG AGC 624 
Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Ser Glu Ser 



195 200 205 

AAA CAG AAG ACC AAA ACC ATC AAA TGC TCC CTC AAC CCT GAG TGG AAT 672 
Lys Gin Lys Thr Lys Thr lie Lys Cys Ser Leu Asn Pro Glu Trp Asn 
210 215 220 

GAG ACA TTT AGA TTT CAG CTG AAA GAA TCG GAC AAA GAC AGA AGA CTG 720 
Glu Thr Phe Arg Phe Gin Leu Lys Glu Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

TCA GTA GAG ATT TGG GAT TGG GAT TTG ACC AGC AGG AAT GAC TTC ATG 768 
Ser Val Glu He Trp Asp Trp Asp Leu Thr Ser Arg Asn Asp Phe Met 
245 250 255 

GGA TCT TTG TCC TTT GGG ATT TCT GAA CTT CAG AAG GCC AGT GTT GAT 816 
Gly Ser Leu Ser Phe Gly He Ser Glu Leu Gin Lys Ala Ser Val Asp 
260 265 270 

GGC TGG TTT AAG TTA CTG AGC CAG GAG GAA GGC GAG TAC TTC AAT GTG 864 
Gly Trp Phe Lys Leu Leu Ser Gin Glu Glu Gly Glu Tyr Phe Asn Val 
275 280 285 

CCT GTG CCA CCA GAA GGA AGT GAG GCC AAT GAA GAA CTG CGG CAG AAA 912 
Pro Val Pro Pro Glu Gly Ser Glu Ala Asn Glu Glu Leu Arg Gin Lys 
290 295 300 

TTT GAG AGG GCC AAG ATC AGT CAG GGA ACC AAG GTC CCG GAA GAA AAG 960 
Phe Glu Arg Ala Lys He Ser Gin Gly Thr Lys Val Pro Glu Glu Lys 
305 310 315 320 

ACG ACC AAC ACT GTC TCC AAA TTT GAC AAC AAT GGC AAC AGA GAC CGG 1008 
Thr Thr Asn Thr Val Ser Lys Phe Asp Asn Asn Gly Asn Arg Asp Arg 
325 330 335 

ATG AAA CTG ACC GAT TTT AAC TTC CTA ATG GTG CTG GGG AAA GGC AGC 1056 
Met Lys Leu Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser 
340 345 350 

TTT GGC AAG GTC ATG CTT TCA GAA CGA AAA GGC ACA GAT GAG CTC TAT 1104 
Phe Gly Lys Val Met Leu Ser Glu Arg Lys Gly Thr Asp Glu Leu Tyr 
355 360 365 

GCT GTG AAG ATC CTG AAG AAG GAC GTT GTG ATC CAA GAT GAT GAC GTG 1152 
Ala Val Lys He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val 
370 375 380 

GAG TGC ACT ATG GTG GAG AAG CGG GTG TTG GCC CTG CCT GGG AAG CCG 1200 
Glu Cys Thr Met Val Glu Lys Arg Val Leu Ala Leu Pro Gly Lys Pro 
385 390 395 400 

CCC TTC CTG ACC CAG CTC CAC TCC TGC TTC CAG ACC ATG GAC CGC CTG 1248 
Pro Phe Leu Thr Gin Leu His Ser Cys Phe Gin Thr Met Asp Arg Leu 
405 410 415 
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TAC TTT GTG ATG GAG TAC GTG AAT GGG GGC GAC CTC ATG TAT CAC ATC 12 96 

Tyr Phe Val Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He 
420 425 430 

CAG CAA GTC GGC CGG TTC AAG GAG CCC CAT GCT GTA TTT TAC GCT GCA 1344 
Gin Gin Val Gly Arg Phe Lys Glu Pro His Ala Val Phe Tyr Ala Ala 
435 440 445 

GAA ATT GCC ATC GGT CTG TTC TTC TTA CAG AGT AAG GGC ATC ATT TAC 13 92 
Glu He Ala He Gly Leu Phe Phe Leu Gin Ser Lys Gly He He Tyr 
450 455 460 

CGT GAC CTA AAA CTT GAC AAC GTG ATG CTC GAT TCT GAG GGA CAC ATC 1440 
Arg Asp Leu Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His He 
465 470 475 480 

AAG ATT GCC GAT TTT GGC ATG TGT AAG GAA AAC ATC TGG GAT GGG GTG 1488 
Lys He Ala Asp Phe Gly Met Cys Lys Glu Asn He Trp Asp Gly Val 
485 490 495 

ACA ACC AAG ACA TTC TGT GGC ACT CCA GAC TAC ATC GCC CCC GAG ATA 1536 
Thr Thr Lys Thr Phe Cys Gly Thr Pro Asp Tyr He Ala Pro Glu He 
500 505 510 

ATT GCT TAT CAG CCC TAT GGG AAG TCC GTG GAT TGG TGG GCA TTT GGA 15 84 
He Ala Tyr Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Phe Gly 
515 520 525 

GTC CTG CTG TAT GAA ATG TTG GCT GGG CAG GCA CCC TTT GAA GGG GAG 1632 
Val Leu Leu Tyr Glu Met Leu Ala Gly Gin Ala Pro Phe Glu Gly Glu 
530 535 540 

GAT GAA GAT GAA CTC TTC CAA TCC ATC ATG GAA CAC AAC GTA GCC TAT 16 80 

Asp Glu Asp Glu Leu Phe Gin Ser He Met Glu His Asn Val Ala Tyr 
545 550 555 560 

CCC AAG TCT ATG TCC AAG GAA GCT GTG GCC ATC TGC AAA GGG CTG ATG 17 28 
Pro Lys Ser Met Ser Lys Glu Ala Val Ala lie Cys Lys Gly Leu Met 
565 570 575 

ACC AAA CAC CCA GGC AAA CGT CTG GGT TGT GGA CCT GAA GGC GAA CGT 1776 
Thr Lys His Pro Gly Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg 
580 585 590 

GAT ATC AAA GAG CAT GCA TTT TTC CGG TAT ATT GAT TGG GAG AAA CTT 1824 
Asp He Lys Glu His Ala Phe Phe Arg Tyr He Asp Trp Glu Lys Leu 
595 600 605 

GAA CGC AAA GAG ATC CAG CCC CCT TAT AAG CCA AAA GCT TGT GGG CGA 1872 
Glu Arg Lys Glu He Gin Pro Pro Tyr Lys Pro Lys Ala Cys Gly Arg 
610 615 620 

AAT GCT GAA AAC TTC GAC CGA TTT TTC ACC CGC CAT CCA CCA GTC CTA 1920 
Asn Ala Glu Asn Phe Asp Arg Phe Phe Thr Arg His Pro Pro Val Leu 
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625 630 635 640 

ACA CCT CCC GAC CAG GAA GTC ATC AGG AAT ATT GAC CAA TCA GAA TTC 1968 

Thr Pro Pro Asp Gin Glu Val lie Arg Asn lie Asp Gin Ser Glu Phe 
645 650 655 

GAA GGA TTT TCC TTT GTT AAC TCT GAA TTT TTA AAA CCC GAA GTC AAG 2016 

Glu Gly Phe Ser Phe Val Asn Ser Glu Phe Leu Lys Pro Glu Val Lys 
660 665 670 

AGC TCG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC GAG GAG CTG 2064 

Ser Ser Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly Glu Glu Leu 
675 680 685 

TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC 2112 

Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn 

690 695 700 

GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT GCC ACC TAC 2160 

Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 
705 710 715 720 

GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG CTG CCC GTG 2208 

Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val 
725 730 735 

CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG CAG TGC TTC 22 56 

Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gin Cys Phe 
740 745 750 

AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC AAG TCC GCC 2 3 04 

Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala 
755 760 765 

ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC AAG GAC GAC 23 52 

Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp 

770 775 780 

GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG 2400 

Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu 
785 790 795 800 

GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC 2448 

Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn 
805 810 815 

ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT 24 96 

lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr 
820 825 830 

ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC 2544 

lie Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He 
835 840 845 
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CGC CAC AAC ATC GAG GAC GGC AGC GTG 
Arg His Asn lie Glu Asp Gly Ser Val 
850 855 

CAG AAC ACC CCC ATC GGC GAC GGC CCC 
Gin Asn Thr Pro lie Gly Asp Gly Pro 
865 870 

TAC CTG AGC ACC CAG TCC GCC CTG AGC 

iyr Leu Ser Thr Gin Ser Ala Leu Ser 
885 

GAT CAC ATG GTC CTG CTG GAG TTC GTG 

Asp His Met Val Leu Leu Glu Phe Val 

900 905 

GGC ATG GAC GAG CTG TAC AAG TAA 
Gly Met Asp Glu Leu Tyr Lys 
915 



CAG CTC GCC GAC CAC TAC CAG 2 592 

Gin Leu Ala Asp His Tyr Gin 
860 

GTG CTG CTG CCC GAC AAC CAC 2640 

Val Leu Leu Pro Asp Asn His 
875 880 

AAA GAC CCC AAC GAG AAG CGC 2688 

Lys Asp Pro Asn Glu Lys Arg 

890 895 

ACC GCC GCC GGG ATC ACT CTC 2736 

Thr Ala Ala Gly He Thr Leu 
910 

2760 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Met Ala Asp Pro Ala Ala Gly Pro Pro Pro Ser Glu Gly Glu Glu Ser 

15 10 15 

Thr Val Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 

20 25 30 

Glu Val Lys Asn His Lys Phe Thr Ala Arg Phe Phe Lys Gin Pro Thr 

35 40 45 

Phe Cys Ser His Cys Thr Asp Phe He Trp Gly Phe Gly Lys Gin Gly 

50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 
65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Ala Ser Asp 

85 90 95 

Asp Pro Arg Ser Lys His Lys Phe Lys He His Thr Tyr Ser Ser Pro 

100 105 110 

Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 

115 120 125 

Gly Met Lys Cys Asp Thr Cys Met Met Asn Val His Lys Arg Cys Val 

130 135 140 

Met Asn Val Pro Ser Leu Cys Gly Thr Asp His Thr Glu Arg Arg Gly 
145 150 155 160 



Arg lie Tyr He Gin Ala His He Asp Arg Asp Val Leu He Val Leu 

165 170 175 

Val Arg Asp Ala Lys Asn Leu Val Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 

Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Ser Glu Ser 

195 200 205 

Lys Gin Lys Thr Lys Thr He Lys Cys Ser Leu Asn Pro Glu Trp Asn 

210 215 220 

Glu Thr Phe Arg Phe Gin Leu Lys Glu Ser Asp Lys Asp Arg Arg Leu 
225 230 235 240 

Ser Val Glu lie Trp Asp Trp Asp Leu Thr Ser Arg Asn Asp Phe Met 

245 250 255 

Gly Ser Leu Ser Phe Gly He Ser Glu Leu Gin Lys Ala Ser Val Asp 

260 265 270 

Gly Trp Phe Lys Leu Leu Ser Gin Glu Glu Gly Glu Tyr Phe Asn Val 

275 280 285 

Pro Val Pro Pro Glu Gly Ser Glu Ala Asn Glu Glu Leu Arg Gin Lys 

290 295 300 

Phe Glu Arg Ala Lys lie Ser Gin Gly Thr Lys Val Pro Glu Glu Lys 
305 310 315 320 

Thr Thr Asn Thr Val Ser Lys Phe Asp Asn Asn Gly Asn Arg Asp Arg 

325 330 335 

Met Lys Leu Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser 

340 345 350 

Phe Gly Lys Val Met Leu Ser Glu Arg Lys Gly Thr Asp Glu Leu Tyr 

355 360 365 

Ala Val Lys lie Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val 

370 375 380 

Glu Cys Thr Met Val Glu Lys Arg Val Leu Ala Leu Pro Gly Lys Pro 
385 390 395 400 

Pro Phe Leu Thr Gin Leu His Ser Cys Phe Gin Thr Met Asp Arg Leu 

405 410 415 

Tyr Phe Val Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His He 

420 425 430 

Gin Gin Val Gly Arg Phe Lys Glu Pro His Ala Val Phe Tyr Ala Ala 

435 440 445 

Glu He Ala lie Gly Leu Phe Phe Leu Gin Ser Lys Gly He He Tyr 

450 455 460 

Arg Asp Leu Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His He 
465 470 475 480 

Lys He Ala Asp Phe Gly Met Cys Lys Glu Asn lie Trp Asp Gly Val 

485 490 495 

Thr Thr Lys Thr Phe Cys Gly Thr Pro Asp Tyr lie Ala Pro Glu lie 

500 505 510 

lie Ala Tyr Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Phe Gly 

515 520 525 

Val Leu Leu Tyr Glu Met Leu Ala Gly Gin Ala Pro Phe Glu Gly Glu 

530 535 540 

Asp Glu Asp Glu Leu Phe Gin Ser lie Met Glu His Asn Val Ala Tyr 
545 550 555 560 

Pro Lys Ser Met Ser Lys Glu Ala Val Ala He Cys Lys Gly Leu Met 

565 570 575 

Thr Lys His Pro Gly Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg 
580 585 590 
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Asp He Lys Glu His 
595 

Glu Arg Lys Glu He 
610 

Asn Ala Glu Asn Phe 
625 

Thr Pro Pro Asp Gin 
645 

Glu Gly Phe Ser Phe 
660 

Ser Ser Asp Pro Pro 
675 

Phe Thr Gly Val Val 
690 

Gly His Lys Phe Ser 
705 

Gly Lys Leu Thr Leu 
725 

Pro Trp Pro Thr Leu 
740 

Ser Arg Tyr Pro Asp 
755 

Met Pro Glu Gly Tyr 
770 

Gly Asn Tyr Lys Thr 
785 

Val Asn Arg He Glu 
805 

He Leu Gly His Lys 
820 

He Met Ala Asp Lys 
835 

Arg His Asn He Glu 
850 

Gin Asn Thr Pro He 
865 

Tyr Leu Ser Thr Gin 
885 

Asp His Met Val Leu 

900 

Gly Met Asp Glu Leu 
915 



Ala Phe Phe Arg Tyr He 
600 

Gin Pro Pro Tyr Lys Pro 
615 

Asp Arg Phe Phe Thr Arg 
630 635 
Glu Val He Arg Asn He 
650 

Val Asn Ser Glu Phe Leu 
665 

Val Ala Thr Met Val Ser 
680 

Pro He Leu Val Glu Leu 
695 

Val Ser Gly Glu Gly Glu 
710 715 
Lys Phe He Cys Thr Thr 
730 

Val Thr Thr Leu Thr Tyr 
745 

His Met Lys Gin His Asp 
760 

Val Gin Glu Arg Thr He 

775 

Arg Ala Glu Val Lys Phe 
790 795 
Leu Lys Gly He Asp Phe 
810 

Leu Glu Tyr Asn Tyr Asn 
825 

Gin Lys Asn Gly He Lys 
840 

Asp Gly Ser Val Gin Leu 
855 

Gly Asp Gly Pro Val Leu 
870 875 
Ser Ala Leu Ser Lys Asp 
890 

Leu Glu Phe Val Thr Ala 
905 

Tyr Lys 



Asp Trp Glu Lys Leu 
605 

Lys Ala Cys Gly Arg 
620 

His Pro Pro Val Leu 
640 

Asp Gin Ser Glu Phe 
655 

Lys Pro Glu Val Lys 
670 

Lys Gly Glu Glu Leu 
685 

Asp Gly Asp Val Asn 
700 

Gly Asp Ala Thr Tyr 
720 

Gly Lys Leu Pro Val 
735 

Gly Val Gin Cys Phe 
750 

Phe Phe Lys Ser Ala 
765 

Phe Phe Lys Asp Asp 
780 

Glu Gly Asp Thr Leu 
800 

Lys Glu Asp Gly Asn 
815 

Ser His Asn Val Tyr 
830 

Val Asn Phe Lys He 
845 

Ala Asp His Tyr Gin 
860 

Leu Pro Asp Asn His 
880 

Pro Asn Glu Lys Arg 
895 

Ala Gly He Thr Leu 
910 



(2) INFORMATION FOR SEQ ID NO: 148: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



2$ 2- 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .3006 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

ATG GCT CAG CAG ACA AGC CCG GAC ACT TTA ACA GTA CCT GAA GTG GAT 4 8 

Met Ala Gin Gin Thr Ser Pro Asp Thr Leu Thr Val Pro Glu Val Asp 
15 10 15 

AAT CCG CAT TGT CCA AAC CCG TGG CTG AAC GAA GAC CTT GTG AAA TCC 96 
Asn Pro His Cys Pro Asn Pro Trp Leu Asn Glu Asp Leu Val Lys Ser 
20 25 30 

TTG CGA GAA AAC CTG TTG CAG CAT GAG AAG TCC AAG ACA GCG AGG AAA 144 
Leu Arg Glu Asn Leu Leu Gin His Glu Lys Ser Lys Thr Ala Arg Lys 
35 40 45 

TCG GTT TCT CCC AAG CTC TCT CCA GTG ATC TCT CCG AGA AAT TCC CCC 192 
Ser Val Ser Pro Lys Leu Ser Pro Val lie Ser Pro Arg Asn Ser Pro 
50 55 60 

AGG CTT CTG CGC AGA ATG CTT CTC AGC AGC AAC ATC CCC AAA CAG CGG 240 
Arg Leu Leu Arg Arg Met Leu Leu Ser Ser Asn lie Pro Lys Gin Arg 
65 70 75 80 

CGT TTC ACG GTG GCA CAT ACA TGT TTT GAT GTG GAC AAT GGC ACA TCT 2 88 

Arg Phe Thr Val Ala His Thr Cys Phe Asp Val Asp Asn Gly Thr Ser 
85 90 95 

GCG GGA CGG AGT CCC TTG GAT CCC ATG ACC AGC CCA GGA TCC GGG CTA 336 
Ala Gly Arg Ser Pro Leu Asp Pro Met Thr Ser Pro Gly Ser Gly Leu 
100 105 110 

ATT CTC CAA GCA AAT TTT GTC CAC AGT CAA CGA CGG GAG TCC TTC CTG 3 84 

lie Leu Gin Ala Asn Phe Val His Ser Gin Arg Arg Glu Ser Phe Leu 
115 120 125 

TAT CGA TCC GAC AGC GAT TAT GAC CTC TCT CCA AAG TCT ATG TCC CGG 432 
Tyr Arg Ser Asp Ser Asp Tyr Asp Leu Ser Pro Lys Ser Met Ser Arg 
130 135 140 

AAC TCC TCC ATT GCC AGT GAT ATA CAC GGA GAT GAC TTG ATT GTG ACT 480 
Asn Ser Ser lie Ala Ser Asp lie His Gly Asp Asp Leu lie Val Thr 
145 150 155 160 

CCA TTT GCT CAG GTC TTG GCC AGT CTG CGA ACT GTA CGA AAC AAC TTT 528 
Pro Phe Ala Gin Val Leu Ala Ser Leu Arg Thr Val Arg Asn Asn Phe 
165 170 175 

GCT GCA TTA ACT AAT TTG CAA GAT CGA GCA CCT AGC AAA AGA TCA CCC 57 6 

Ala Ala Leu Thr Asn Leu Gin Asp Arg Ala Pro Ser Lys Arg Ser Pro 
180 185 190 
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ATG TGC AAC 
Met Cys Asn 
195 

TAC CAG AAA 
Tyr Gin Lys 
210 

GAC CAG CTA 

Asp Gin Leu 
225 

TCC AAC AAG 

Ser Asn Lys 

GAA ATG AGT 
Glu Met Ser 

TTC TTA GAT 
Phe Leu Asp 
275 

GAA AAG GAG 
Glu Lys Glu 
290 

AAA TTG ATG 
Lys Leu Met 
305 

GGA GTT AAA 
Gly Val Lys 



GTG AAC AAA 
Val Asn Lys 

AAC CGG CCC 
Asn Arg Pro 
355 

TTA TTA AAA 
Leu Leu Lys 
370 

ATG ACT CTC 
Met Thr Leu 
385 

ATC CAT GCT 
lie His Ala 



CAA CCA TCC 
Gin Pro Ser 



CTG GCC AGC 
Leu Ala Ser 



GAG ACC CTA 
Glu Thr Leu 
230 

TTT AAA AGG 
Phe Lys Arg 
245 

CGG TCT GGA 
Arg Ser Gly 
260 

AAG CAA CAT 
Lys Gin His 

AAA AAG AAA 
Lys Lys Lys 

CAC AGC TCT 
Kis Ser Ser 
310 

ACT GAA CAA 
Thr Glu Gin 
325 

TGG GGT CTT 
Trp Gly Leu 
340 

TTG ACT GTT 
Leu Thr Val 



ACA TTT AAA 
Thr Phe Lys 

GAA GAC CAT 
Glu Asp His 
390 

GCA GAT GTT 
Ala Asp Val 



ATC AAC AAA 
lie Asn Lys 
200 

GAG ACC CTG 
Glu Thr Leu 
215 

CAG ACC AGG 
Gin Thr Arg 

ATG CTT AAT 
Met Leu Asn 

AAT CAA GTG 
Asn Gin Val 
265 

GAA GTG GAA 
Glu Val Glu 

280 

AGA CCA ATG 
Arg Pro Met 
295 

AGT CTG ACT 
Ser Leu Thr 



GAA GAT GTC 
Glu Asp Val 



CAT GTT TTC 
His Val Phe 
345 

ATC ATG CAC 
He Met His 
360 

ATT CCA GTA 
He Pro Val 
375 

TAC CAT GCT 
Tyr His Ala 

GTC CAG TCT 
Val Gin Ser 



GCC ACC ATA 
Ala Thr He 



GAG GAG CTG 
Glu Glu Leu 
220 

CAC TCC GTC 
His Ser Val 
235 

CGG GAG CTC 
Arg Glu Leu 
250 

TCA GAG TTT 
Ser Glu Phe 



ATT CCT TCT 
He Pro Ser 



TCT CAG ATC 
Ser Gin He 
300 

AAT TCA AGT 
Asn Ser Ser 
315 

CTT GCC AAG 
Leu Ala Lys 
330 

AGA ATA GCA 
Arg He Ala 



ACC ATT TTT 
Thr He Phe 



GAT ACT TTA 
Asp Thr Leu 

380 

GAT GTG GCC 
Asp Val Ala 
395 

ACT CAT GTG 
Thr His Val 



ACA GAG GAG 
Thr Glu Glu 
205 

GAC TGG TGT 
Asp Trp Cys 

AGT GAG ATG 
Ser Glu Met 



ACC CAT CTC 
Thr His Leu 
255 

ATA TCA AAC 
He Ser Asn 
270 

CCA ACT CAG 
Pro Thr Gin 
285 

AGT GGA GTC 
Ser Gly Val 



ATC CCA AGG 
He Pro Arg 

GAA CTA GAA 
Glu Leu Glu 
335 

GAG TTG TCT 
Glu Leu Ser 
350 

CAG GAA CGG 
Gin Glu Arg 
365 

ATT ACA TAT 
He Thr Tyr 

TAT CAC AAC 
Tyr His Asn 

CTA TTA TCT 
Leu Leu Ser 



GCC 624 
Ala 



CTG 672 
Leu 



GCC 720 

Ala 

240 

TCT 768 
Ser 



ACA 816 
Thr 



AAG 864 
Lys 

AAG 912 
Lys 

TTT 960 

Phe 

320 

GAT 1008 
Asp 

GGT 1056 
Gly 

GAT 1104 
Asp 

CTT 1152 
Leu 

AAT 1200 

Asn 

400 

ACA 1248 
Thr 



405 410 415 

CCT GCT TTG GAG GOT GTG TIT AC A GAT TTG GAG ATT CTT GCA GCA ATT 1296 
Pro Ala Leu Glu Ala Val Phe Thr Asp Leu Glu He Leu Ala Ala He 
420 425 430 

TTT GCC AGT GCA ATA CAT GAT GTA GAT CAT CCT GGT GTG TCC AAT CAA 1344 
Phe Ala Ser Ala He His Asp Val Asp His Pro Gly Val Ser Asn Gin 
435 440 445 

TTT CTG ATC AAT ACA AAC TCT GAA CTT GCC TTG ATG TAC AAT GAT TCC 1392 
Phe Leu He Asn Thr Asn Ser Glu Leu Ala Leu Met Tyr Asn Asp Ser 
450 455 460 

TCA GTC TTA GAG AAC CAT CAT TTG GCT GTG GGC TTT AAA TTG CTT CAG 1440 
Ser Val Leu Glu Asn His His Leu Ala Val Gly Phe Lys Leu Leu Gin 
465 470 475 480 

GAA GAA AAC TGT GAC ATT TTC CAG AAT TTG ACC AAA AAA CAA AGA CAA 1488 
Glu Glu Asn Cys Asp He Phe Gin Asn Leu Thr Lys Lys Gin Arg Gin 
485 490 495 

TCT TTA AGG AAA ATG GTC ATT GAC ATC GTA CTT GCA ACA GAT ATG TCA 1536 
Ser Leu Arg Lys Met Val He Asp He Val Leu Ala Thr Asp Met Ser 
500 505 510 

AAA CAC ATG AAT CTA CTG GCT GAT TTG AAG ACT ATG GTT GAA ACT AAG 1584 
Lys His Met Asn Leu Leu Ala Asp Leu Lys Thr Met Val Glu Thr Lys 
515 520 525 

AAA GTG ACA AGC TCT GGA GTT CTT CTT CTT GAT AAT TAT TCC GAT AGG 1632 
Lys Val Thr Ser Ser Gly Val Leu Leu Leu Asp Asn Tyr Ser Asp Arg 
530 535 540 

ATT CAG GTT CTT CAG AAT ATG GTG CAC TGT GCA GAT CTG AGC AAC CCA 1680 
He Gin Val Leu Gin Asn Met Val His Cys Ala Asp Leu Ser Asn Pro 
545 550 555 560 

ACA AAG CCT CTC CAG CTG TAC CGC CAG TGG ACG GAC CGG ATA ATG GAG 1728 
Thr Lys Pro Leu Gin Leu Tyr Arg Gin Trp Thr Asp Arg lie Met Glu 
565 570 575 

GAG TTC TTC CGC CAA GGA GAC CGA GAG AGG GAA CGT GGC ATG GAG ATA 177 6 
Glu Phe Phe Arg Gin Gly Asp Arg Glu Arg Glu Arg Gly Met Glu He 
580 585 590 

AGC CCC ATG TGT GAC AAG CAC AAT GCT TCC GTG GAA AAA TCA CAG GTG 1824 
Ser Pro Met Cys Asp Lys His Asn Ala Ser Val Glu Lys Ser Gin Val 
595 600 605 

GGC TTC ATA GAC TAT ATT GTT CAT CCC CTC TGG GAG ACA TGG GCA GAC 1872 
Gly Phe He Asp Tyr He Val His Pro Leu Trp Glu Thr Trp Ala Asp 
610 615 620 
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CTC GTC CAC CCT GAC GCC CAG GAT ATT TTG GAC ACT TTG GAG GAC AAT 192 0 

Leu Val His Pro Asp Ala Gin Asp lie Leu Asp Thr Leu Glu Asp Asn 
625 630 635 640 

CGT GAA TGG TAC CAG AGC ACA ATC CCT CAG AGC CCC TCT CCT GCA CCT 1968 
Arg Glu Trp Tyr Gin Ser Thr lie Pro Gin Ser Pro Ser Pro Ala Pro 
645 650 655 

GAT GAC CCA GAG GAG GGC CGG CAG GGT CAA ACT GAG AAA TTC CAG TTT 2 016 
Asp Asp Pro Glu Glu Gly Arg Gin Gly Gin Thr Glu Lys Phe Gin Phe 
660 665 670 

GAA CTA ACT TTA GAG GAA GAT GGT GAG TCA GAC ACG GAA AAG GAC AGT 2064 
Glu Leu Thr Leu Glu Glu Asp Gly Glu Ser Asp Thr Glu Lys Asp Ser 
675 680 685 

GGC AGT CAA GTG GAA GAA GAC ACT AGC TGC AGT GAC TCC AAG ACT CTT 2112 
Gly Ser Gin Val Glu Glu Asp Thr Ser Cys Ser Asp Ser Lys Thr Leu 
690 695 700 

TGT ACT CAA GAC TCA GAG TCT ACT GAA ATT CCC CTT GAT GAA CAG GTT 2160 
Cys Thr Gin Asp Ser Glu Ser Thr Glu lie Pro Leu Asp Glu Gin Val 
705 710 715 720 

GAA GAG GAG GCA GTA GGG GAA GAA GAG GAA AGC CAG CCT GAA GCC TGT 2208 
Glu Glu Glu Ala Val Gly Glu Glu Glu Glu Ser Gin Pro Glu Ala Cys 
725 730 735 

GTC ATA GAT GAT CGT TCT CCT GAC ACG ACG GGA ATT CTG CAG TCG ACG 2256 
Val lie Asp Asp Arg Ser Pro Asp Thr Thr Gly He Leu Gin Ser Thr 
740 745 750 

GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC 23 04 
Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 
755 760 765 

GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC 23 52 
Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 
770 775 780 

GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT 2400 
Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
785 790 795 800 

GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG 2448 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 
805 810 815 

CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG 2496 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
820 825 830 

CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC 2544 
Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 



835 840 845 

AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 2 592 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
850 855 860 

AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC 2 640 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 

865 870 875 880 

GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG 2 688 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu 
885 890 895 

GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC 27 36 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
900 905 910 

AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC 27 84 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 
915 920 925 

TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC 2832 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
930 935 940 

CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC 2880 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 

945 950 955 960 

GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC 2 928 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
965 970 975 

GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG 2976 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
980 985 990 

ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 3 009 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
995 1000 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Met Ala Gin Gin Thr Ser Pro Asp Thr Leu Thr Val Pro Glu Val Asp 

15 10 15 

Asn Pro His Cys Pro Asn Pro Trp Leu Asn Glu Asp Leu Val Lys Ser 

20 25 30 

Leu Arg Glu Asn Leu Leu Gin His Glu Lys Ser Lys Thr Ala Arg Lys 

35 40 45 

Ser Val Ser Pro Lys Leu Ser Pro Val lie Ser Pro Arg Asn Ser Pro 

50 55 60 

Arg Leu Leu Arg Arg Met Leu Leu Ser Ser Asn He Pro Lys Gin Arg 
65 70 75 80 

Arg Phe Thr Val Ala His Thr Cys Phe Asp Val Asp Asn Gly Thr Ser 

85 90 95 

Ala Gly Arg Ser Pro Leu Asp Pro Met Thr Ser Pro Gly Ser Gly Leu 

100 105 HO 

He Leu Gin Ala Asn Phe Val His Ser Gin Arg Arg Glu Ser Phe Leu 

115 120 125 

Tyr Arg Ser Asp Ser Asp Tyr Asp Leu Ser Pro Lys Ser Met Ser Arg 

130 135 140 

Asn Ser Ser He Ala Ser Asp He His Gly Asp Asp Leu He Val Thr 
145 150 155 160 

Pro Phe Ala Gin Val Leu Ala Ser Leu Arg Thr Val Arg Asn Asn Phe 

165 170 175 

Ala Ala Leu Thr Asn Leu Gin Asp Arg Ala Pro Ser Lys Arg Ser Pro 

180 185 190 

Met Cys Asn Gin Pro Ser He Asn Lys Ala Thr He Thr Glu Glu Ala 

195 200 205 

Tyr Gin Lys Leu Ala Ser Glu Thr Leu Glu Glu Leu Asp Trp Cys Leu 

210 215 220 

Asp Gin Leu Glu Thr Leu Gin Thr Arg His Ser Val Ser Glu Met Ala 
225 230 235 240 

Ser Asn Lys Phe Lys Arg Met Leu Asn Arg Glu Leu Thr His Leu Ser 

245 250 255 

Glu Met Ser Arg Ser Gly Asn Gin Val Ser Glu Phe He Ser Asn Thr 

260 265 270 

Phe Leu Asp Lys Gin His Glu Val Glu He Pro Ser Pro Thr Gin Lys 

275 280 285 

Glu Lys Glu Lys Lys Lys Arg Pro Met Ser Gin He Ser Gly Val Lys 

290 295 300 

Lys Leu Met His Ser Ser Ser Leu Thr Asn Ser Ser He Pro Arg Phe 
305 310 315 320 

Gly Val Lys Thr Glu Gin Glu Asp Val Leu Ala Lys Glu Leu Glu Asp 

325 330 335 

Val Asn Lys Trp Gly Leu His Val Phe Arg He Ala Glu Leu Ser Gly 

340 345 350 

Asn Arg Pro Leu Thr Val He Met His Thr He Phe Gin Glu Arg Asp 

355 360 365 

Leu Leu Lys Thr Phe Lys He Pro Val Asp Thr Leu He Thr Tyr Leu 

370 375 380 

Met Thr Leu Glu Asp His Tyr His Ala Asp Val Ala Tyr His Asn Asn 
385 390 395 400 

He His Ala Ala Asp Val Val Gin Ser Thr His Val Leu Leu Ser Thr 
405 410 415 
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Pro Ala Leu Glu Ala Val Phe Thr Asp Leu Glu lie Leu Ala Ala He 

420 425 430 

Phe Ala Ser Ala He His Asp Val Asp His Pro Gly Val Ser Asn Gin 

435 440 445 

Phe Leu He Asn Thr Asn Ser Glu Leu Ala Leu Met Tyr Asn Asp Ser 

450 455 460 

Ser Val Leu Glu Asn His His Leu Ala Val Gly Phe Lys Leu Leu Gin 
465 470 475 480 

Glu Glu Asn Cys Asp He Phe Gin Asn Leu Thr Lys Lys Gin Arg Gin 

485 490 495 

Ser Leu Arg Lys Met Val He Asp He Val Leu Ala Thr Asp Met Ser 

500 505 510 

Lys His Met Asn Leu Leu Ala Asp Leu Lys Thr Met Val Glu Thr Lys 

515 520 525 

Lys Val Thr Ser Ser Gly Val Leu Leu Leu Asp Asn Tyr Ser Asp Arg 

530 535 540 

He Gin Val Leu Gin Asn Met Val His Cys Ala Asp Leu Ser Asn Pro 
545 550 555 560 

Thr Lys Pro Leu Gin Leu Tyr Arg Gin Trp Thr Asp Arg He Met Glu 

565 570 575 

Glu Phe Phe Arg Gin Gly Asp Arg Glu Arg Glu Arg Gly Met Glu He 

580 585 590 

Ser Pro Met Cys Asp Lys His Asn Ala Ser Val Glu Lys Ser Gin Val 

595 600 605 

Gly Phe He Asp Tyr He Val His Pro Leu Trp Glu Thr Trp Ala Asp 

610 615 620 

Leu Val His Pro Asp Ala Gin Asp He Leu Asp Thr Leu Glu Asp Asn 
625 630 635 640 

Arg Glu Trp Tyr Gin Ser Thr He Pro Gin Ser Pro Ser Pro Ala Pro 

645 650 655 

Asp Asp Pro Glu Glu Gly Arg Gin Gly Gin Thr Glu Lys Phe Gin Phe 

660 665 670 

Glu Leu Thr Leu Glu Glu Asp Gly Glu Ser Asp Thr Glu Lys Asp Ser 

675 680 685 

Gly Ser Gin Val Glu Glu Asp Thr Ser Cys Ser Asp Ser Lys Thr Leu 

690 695 700 

Cys Thr Gin Asp Ser Glu Ser Thr Glu He Pro Leu Asp Glu Gin Val 
705 710 715 720 

Glu Glu Glu Ala Val Gly Glu Glu Glu Glu Ser Gin Pro Glu Ala Cys 

725 730 735 

Val He Asp Asp Arg Ser Pro Asp Thr Thr Gly He Leu Gin Ser Thr 

740 745 750 

Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

755 760 765 

Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 

770 775 780 

Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
785 790 795 800 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys Thr Thr Gly Lys 

805 810 815 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 

820 825 830 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
835 840 845 



Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 

850 855 860 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
865 870 875 880 

Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He Asp Phe Lys Glu 

885 890 895 

Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 

900 905 910 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

915 920 925 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 

930 935 940 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
945 950 955 960 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 

965 970 975 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 

980 985 990 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
995 1000 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 1...3198 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

ATG GAG GCA GAG GGC AGC AGC GCG CCG GCC CGG GCG GGC AGC GGA GAG 48 
Met Glu Ala Glu Gly Ser Ser Ala Pro Ala Arg Ala Gly Ser Gly Glu 
15 10 15 

GGC AGC GAC AGC GCC GGC GGG GCC ACG CTC AAA GCC CCC AAG CAT CTC 96 
Gly Ser Asp Ser Ala Gly Gly Ala Thr Leu Lys Ala Pro Lys His Leu 
20 25 30 

TGG AGG CAC GAG CAG CAC CAC CAG TAC CCG CTC CGG CAG CCC CAG TTC 144 
Trp Arg His Glu Gin His His Gin Tyr Pro Leu Arg Gin Pro Gin Phe 
35 40 45 

CGC CTC CTG CAT CCC CAT CAC CAC CTG CCC CCG CCG CCG CCA CCC TCG 192 
Arg Leu Leu His Pro His His His Leu Pro Pro Pro Pro Pro Pro Ser 
50 55 60 
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CCC CAG CCC CAG CCC CAG TGT CCG CTA CAG CCG CCG CCG CCG CCC CCC 240 
Pro Gin Pro Gin Pro Gin Cys Pro Leu Gin Pro Pro Pro Pro Pro Pro 
65 70 75 80 

CTG CCG CCG CCC CCG CCG CCG CCC GGG GCT GCC CGC GGC CGC TAC GCC 2 88 

Leu Pro Pro Pro Pro Pro Pro Pro Gly Ala Ala Arg Gly Arg Tyr Ala 
85 90 95 

TCG AGC GGG GCC ACC GGC CGC GTC CGG CAT CGC GGC TAC TCG GAC ACC 3 36 

Ser Ser Gly Ala Thr Gly Arg Val Arg His Arg Gly Tyr Ser Asp Thr 
100 105 110 

GAG CGC TAC CTG TAC TGT CGC GCC ATG GAC CGC ACC TCC TAC GCG GTG 3 84 

Glu Arg Tyr Leu Tyr Cys Arg Ala Met Asp Arg Thr Ser Tyr Ala Val 
115 120 125 

GAG ACC GGC CAC CGG CCC GGC CTG AAG AAA TCC AGG ATG TCC TGG CCC 432 
Glu Thr Gly His Arg Pro Gly Leu Lys Lys Ser Arg Met Ser Trp Pro 
130 135 140 

TCC TCG TTC CAG GGA CTC AGG CGT TTT GAT GTG GAC AAT GGC ACA TCT 4 80 

Ser Ser Phe Gin Gly Leu Arg Arg Phe Asp Val Asp Asn Gly Thr Ser 
145 150 155 160 

GCG GGA CGG AGT CCC TTG GAT CCC ATG ACC AGC CCA GGA TCC GGG CTA 528 
Ala Gly Arg Ser Pro Leu Asp Pro Met Thr Ser Pro Gly Ser Gly Leu 
165 170 175 

ATT CTC CAA GCA AAT TTT GTC CAC AGT CAA CGA CGG GAG TCC TTC CTG 576 
lie Leu Gin Ala Asn Phe Val His Ser Gin Arg Arg Glu Ser Phe Leu 
180 185 190 

TAT CGA TCC GAC AGC GAT TAT GAC CTC TCT CCA AAG TCT ATG TCC CGG 624 
Tyr Arg Ser Asp Ser Asp Tyr A.sp Leu Ser Pro Lys Ser Met Ser Arg 
195 200 205 

AAC TCC TCC ATT GCC AGT GAT ATA CAC GGA GAT GAC TTG ATT GTG ACT 672 
Asn Ser Ser lie Ala Ser Asp lie His Gly Asp Asp Leu lie Val Thr 
210 215 220 

CCA TTT GCT CAG GTC TTG GCC AGT CTG CGA ACT GTA CGA AAC AAC TTT 720 
Pro Phe Ala Gin Val Leu Ala Ser Leu Arg Thr Val Arg Asn Asn Phe 
225 230 235 240 

GCT GCA TTA ACT AAT TTG CAA GAT CGA GCA CCT AGC AAA AGA TCA CCC 7 68 

Ala Ala Leu Thr Asn Leu Gin Asp Arg Ala Pro Ser Lys Arg Ser Pro 
245 250 255 

ATG TGC AAC CAA CCA TCC ATC AAC AAA GCC ACC ATA ACA GAG GAG GCC 816 
Met Cys Asn Gin Pro Ser lie Asn Lys Ala Thr He Thr Glu Glu Ala 
260 265 270 

TAC CAG AAA CTG GCC AGC GAG ACC CTG GAG GAG CTG GAC TGG TGT CTG 864 
Tyr Gin Lys Leu Ala Ser Glu Thr Leu Glu Glu Leu Asp Trp Cys Leu 



2£ / 



275 280 285 

GAC CAG CTA GAG ACC CTA CAG ACC AGG CAC TCC GTC AGT GAG ATG GCC 912 
Asp Gin Leu Glu Thr Leu Gin Thr Arg His Ser Val Ser Glu Met Ala 
290 295 300 

TCC AAC AAG TTT AAA AGG ATG CTT AAT CGG GAG CTC ACC CAT CTC TCT 960 
Ser Asn Lys Phe Lys Arg Met Leu Asn Arg Glu Leu Thr His Leu Ser 
305 310 315 320 

GAA ATG AGT CGG TCT GGA AAT CAA GTG TCA GAG TTT ATA TCA AAC ACA 1008 
Glu Met Ser Arg Ser Gly Asn Gin Val Ser Glu Phe lie Ser Asn Thr 
325 330 335 

TTC TTA GAT AAG CAA CAT GAA GTG GAA ATT CCT TCT CCA ACT CAG AAG 1056 
Phe Leu Asp Lys Gin His Glu Val Glu lie Pro Ser Pro Thr Gin Lys 
340 345 350 

GAA AAG GAG AAA AAG AAA AGA CCA ATG TCT CAG ATC AGT GGA GTC AAG 1104 
Glu Lys Glu Lys Lys Lys Arg Pro Met Ser Gin lie Ser Gly Val Lys 
355 360 365 

AAA TTG ATG CAC AGC TCT AGT CTG ACT AAT TCA AGT ATC CCA AGG TTT 1152 
Lys Leu Met His Ser Ser Ser Leu Thr Asn Ser Ser lie Pro Arg Phe 
370 375 380 

GGA GTT AAA ACT GAA CAA GAA GAT GTC CTT GCC AAG GAA CTA GAA GAT 1200 
Gly Val Lys Thr Glu Gin Glu Asp Val Leu Ala Lys Glu Leu Glu Asp 
385 390 395 400 

GTG AAC AAA TGG GGT CTT CAT GTT TTC AGA ATA GCA GAG TTG TCT GGT 1248 
Val Asn Lys Trp Gly Leu His Val Phe Arg lie Ala Glu Leu Ser Gly 
405 410 415 

AAC CGG CCC TTG ACT GTT ATC ATG CAC ACC ATT TTT CAG GAA CGG GAT 1296 
Asn Arg Pro Leu Thr Val lie Met His Thr lie Phe Gin Glu Arg Asp 
420 425 430 

TTA TTA AAA ACA TTT AAA ATT CCA GTA GAT ACT TTA ATT ACA TAT CTT 13 44 

Leu Leu Lys Thr Phe Lys lie Pro Val Asp Thr Leu lie Thr Tyr Leu 
435 440 445 

ATG ACT CTC GAA GAC CAT TAC CAT GCT GAT GTG GCC TAT CAC AAC AAT 13 92 

Met Thr Leu Glu Asp His Tyr His Ala Asp Val Ala Tyr His Asn Asn 
450 455 460 

ATC CAT GCT GCA GAT GTT GTC CAG TCT ACT CAT GTG CTA TTA TCT ACA 1440 
lie His Ala Ala Asp Val Val Gin Ser Thr His Val Leu Leu Ser Thr 
465 470 475 480 

CCT GCT TTG GAG GCT GTG TTT ACA GAT TTG GAG ATT CTT GCA GCA ATT 1488 
Pro Ala Leu Glu Ala Val Phe Thr Asp Leu Glu lie Leu Ala Ala lie 
485 490 495 



TTT GCC AGT GCA ATA CAT GAT GTA GAT CAT CCT GGT GTG TCC AAT CAA 1536 
Phe Ala Ser Ala He His Asp Val Asp His Pro Gly Val Ser Asn Gin 
500 505 510 

TTT CTG ATC AAT ACA AAC TCT GAA CTT GCC TTG ATG TAC AAT GAT TCC 1584 
Phe Leu He Asn Thr Asn Ser Glu Leu Ala Leu Met Tyr Asn Asp Ser 
515 520 525 

TCA GTC TTA GAG AAC CAT CAT TTG GCT GTG GGC TTT AAA TTG CTT CAG 1632 
Ser Val Leu Glu Asn His His Leu Ala Val Gly Phe Lys Leu Leu Gin 
530 535 540 

GAA GAA AAC TGT GAC ATT TTC CAG AAT TTG ACC AAA AAA CAA AGA CAA 1680 
Glu Glu Asn Cys Asp He Phe Gin Asn Leu Thr Lys Lys Gin Arg Gin 
545 550 555 560 

TCT TTA AGG AAA ATG GTC ATT GAC ATC GTA CTT GCA ACA GAT ATG TCA 17 28 
Ser Leu Arg Lys Met Val He Asp He Val Leu Ala Thr Asp Met Ser 
565 570 575 

AAA CAC ATG AAT CTA CTG GCT GAT TTG AAG ACT ATG GTT GAA ACT AAG 1776 
Lys His Met Asn Leu Leu Ala Asp Leu Lys Thr Met Val Glu Thr Lys 
580 585 590 

AAA GTG ACA AGC TCT GGA GTT CTT CTT CTT GAT AAT TAT TCC GAT AGG 1824 
Lys Val Thr Ser Ser Gly Val Leu Leu Leu Asp Asn Tyr Ser Asp Arg 
595 600 605 

ATT CAG GTT CTT CAG AAT ATG GTG CAC TGT GCA GAT CTG AGC AAC CCA 1872 
He Gin Val Leu Gin Asn Met Val His Cys Ala Asp Leu Ser Asn Pro 
610 615 620 

ACA AAG CCT CTC CAG CTG TAC CGC CAG TGG ACG GAC CGG ATA ATG GAG 1920 
Thr Lys Pro Leu Gin Leu Tyr Arg Gin Trp Thr Asp Arg He Met Glu 
625 630 635 640 

GAG TTC TTC CGC CAA GGA GAC CGA GAG AGG GAA CGT GGC ATG GAG ATA 1968 
Glu Phe Phe Arg Gin Gly Asp Arg Glu Arg Glu Arg Gly Met Glu He 
645 650 655 

AGC CCC ATG TGT GAC AAG CAC AAT GCT TCC GTG GAA AAA TCA CAG GTG 2016 
Ser Pro Met Cys Asp Lys His Asn Ala Ser Val Glu Lys Ser Gin Val 
660 665 670 

GGC TTC ATA GAC TAT ATT GTT CAT CCC CTC TGG GAG ACA TGG GCA GAC 2064 
Gly Phe lie Asp Tyr lie Val His Pro Leu Trp Glu Thr Trp Ala Asp 
675 680 685 

CTC GTC CAC CCT GAC GCC CAG GAT ATT TTG GAC ACT TTG GAG GAC AAT 2112 
Leu Val His Pro Asp Ala Gin Asp lie Leu Asp Thr Leu Glu Asp Asn 
690 695 700 

CGT GAA TGG TAC CAG AGC ACA ATC CCT CAG AGC CCC TCT CCT GCA CCT 2160 
Arg Glu Trp Tyr Gin Ser Thr He Pro Gin Ser Pro Ser Pro Ala Pro 
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705 710 715 720 

GAT GAC CCA GAG GAG GGC CGG GAG GGT CAA ACT GAG AAA TTC CAG TTT 2208 
Asp Asp Pro Glu Glu Gly Arg Gin Gly Gin Thx Glu Lys Phe Gin Phe 
725 730 735 

GAA CTA ACT TTA GAG GAA GAT GGT GAG TCA GAC ACG GAA AAG GAC AGT 2256 
Glu Leu Thr Leu Glu Glu Asp Gly Glu Ser Asp Thr Glu Lys Asp Ser 
740 745 750 

GGC AGT CAA GTG GAA GAA GAC ACT AGC TGC AGT GAC TCC AAG ACT CTT 2304 
Gly Ser Gin Val Glu Glu Asp Thr Ser Cys Ser Asp Ser Lys Thr Leu 
755 760 765 

TGT ACT CAA GAC TCA GAG TCT ACT GAA ATT CCC CTT GAT GAA CAG GTT 23 52 
Cys Thr Gin Asp Ser Glu Ser Thr Glu lie Pro Leu Asp Glu Gin Val 
770 775 780 

GAA GAG GAG GCA GTA GGG GAA GAA GAG GAA AGC CAG CCT GAA GCC TGT 2400 
Glu Glu Glu Ala Val Gly Glu Glu Glu Glu Ser Gin Pro Glu Ala Cys 
785 790 795 800 

GTC ATA GAT GAT CGT TCT CCT GAC ACG ACG GGA ATT CTG CAG TCG ACG 2448 
Val lie Asp Asp Arg Ser Pro Asp Thr Thr Gly lie Leu Gin Ser Thr 
805 810 815 

GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC ATG GTG AGC AAG GGC 2496 
Val Pro Arg Ala Arg Asp Fro Pro Val Ala Thr Met Val Ser Lys Gly 
820 825 830 

GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC 2544 
Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asp Gly 
835 840 845 

GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC GAG GGC GAG GGC GAT 2592 
Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 
850 855 860 

GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC TGC ACC ACC GGC AAG 2640 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys 
865 870 875 880 

CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC CTG ACC TAC GGC GTG 2 688 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
885 890 895 

CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG CAG CAC GAC TTC TTC 273 6 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
900 905 910 

AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG CGC ACC ATC TTC TTC 27 84 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe 
915 920 925 



AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG GTG AAG TTC GAG GGC 2832 
Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
930 935 940 

GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC ATC GAC TTC AAG GAG 2880 
Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu 
945 950 955 960 

GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC AAC TAC AAC AGC CAC 2 928 
Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
965 970 975 

AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC GGC ATC AAG GTG AAC 297 6 
Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn 
980 985 990 

TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC GTG CAG CTC GCC GAC 3024 
Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 
995 1000 1005 

CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC CCC GTG CTG CTG CCC 3072 
His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 
1010 1015 1020 

GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG AGC AAA GAC CCC AAC 3120 
Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
1025 1030 1035 1040 

GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC GTG ACC GCC GCC GGG 3168 
Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
1045 1050 1055 

ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 3201 
He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1060 1065 



(2) INFORMATION FOR SEQ ID NO:151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1066 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 

Met Glu Ala Glu Gly Ser Ser Ala Pro Ala Arg Ala Gly Ser Gly Glu 

15 10 15 

Gly Ser Asp Ser Ala Gly Gly Ala Thr Leu Lys Ala Pro Lys His Leu 
20 25 30 
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Trp Arg His Glu 
35 

Arg Leu Leu His 
50 

Pro Gin Pro Gin 
65 

Leu Pro Pro Pro 

Ser Ser Gly Ala 
100 

Glu Arg Tyr Leu 
115 

Glu Thr Gly His 
130 

Ser Ser Phe Gin 
145 

Ala Gly Arg Ser 

lie Leu Gin Ala 
180 

Tyr Arg Ser Asp 
195 

Asn Ser Ser lie 
210 

Pro Phe Ala Gin 
225 

Ala Ala Leu Thr 

Met Cys Asn Gin 

260 

Tyr Gin Lys Leu 
275 

Asp Gin Leu Glu 
290 

Ser Asn Lys Phe 
305 

Glu Met Ser Arg 

Phe Leu Asp Lys 
340 

Glu Lys Glu Lys 
355 

Lys Leu Met His 
370 

Gly Val Lys Thr 
385 

Val Asn Lys Trp 

Asn Arg Fro Leu 
420 

Leu Leu Lys Thr 
435 

Met Thr Leu Glu 
450 



Gin His His Gin 
40 

Pro His His His 
55 

Pro Gin Cys Pro 
70 

Pro Pro Pro Pro 
85 

Thr Gly Arg Val 

Tyr Cys Arg Ala 
120 

Arg Pro Gly Leu 
135 

Gly Leu Arg Arg 
150 

Pro Leu Asp Pro 
165 

Asn Phe Val His 

Ser Asp Tyr Asp 
200 

Ala Ser Asp lie 
215 

Val Leu Ala Ser 
230 

Asn Leu Gin Asp 
245 

Pro Ser lie Asn 

Ala Ser Glu Thr 
280 

Thr Leu Gin Thr 
295 

Lys Arg Met Leu 
310 

Ser Gly Asn Gin 
325 

Gin His Glu Val 

Lys Lys Arg Pro 
360 

Ser Ser Ser Leu 
375 

Glu Gin Glu Asp 
390 

Gly Leu His Val 
405 

Thr Val lie Met 

Phe Lys lie Pro 
440 

Asp His Tyr His 
455 



Tyr Pro Leu Arg 

Leu Pro Pro Pro 
60 

Leu Gin Pro Pro 
75 

Gly Ala Ala Arg 
90 

Arg His Arg Gly 
105 

Met Asp Arg Thr 

Lys Lys Ser Arg 
140 

Phe Asp Val Asp 
155 

Met Thr Ser Pro 
170 

Ser Gin Arg Arg 
185 

Leu Ser Pro Lys 

His Gly Asp Asp 
220 

Leu Arg Thr Val 

235 

Arg Ala Pro Ser 
250 

Lys Ala Thr He 
265 

Leu Glu Glu Leu 

Arg His Ser Val 
300 

Asn Arg Glu Leu 
315 

Val Ser Glu Phe 
330 

Glu He Pro Ser 
345 

Met Ser Gin He 

Thr Asn Ser Ser 
380 

Val Leu Ala Lys 
395 

Phe Arg He Ala 
410 

His Thr He Phe 
425 

Val Asp Thr Leu 

Ala Asp Val Ala 
460 



Gin Pro Gin Phe 
45 

Pro Pro Pro Ser 

Pro Pro Pro Pro 
80 

Gly Arg Tyr Ala 
95 

Tyr Ser Asp Thr 
110 

Ser Tyr Ala Val 
125 

Met Ser Trp Pro 

Asn Gly Thr Ser 
160 

Gly Ser Gly Leu 
175 

Glu Ser Phe Leu 
190 

Ser Met Ser Arg 
205 

Leu He Val Thr 

Arg Asn Asn Phe 
240 

Lys Arg Ser Pro 
255 

Thr Glu Glu Ala 
270 

Asp Trp Cys Leu 
285 

Ser Glu Met Ala 

Thr His Leu Ser 
320 

He Ser Asn Thr 
335 

Pro Thr Gin Lys 
350 

Ser Gly Val Lys 
365 

He Pro Arg Phe 

Glu Leu Glu Asp 
400 

Glu Leu Ser Gly 
415 

Gin Glu Arg Asp 
430 

He Thr Tyr Leu 
445 

Tyr His Asn Asn 
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lie His Ala Ala Asp Val Val Gin Ser Thr His Val Leu Leu Ser Thr 
465 470 475 480 

Pro Ala Leu Glu Ala Val Phe Thr Asp Leu Glu lie Leu Ala Ala lie 

485 490 495 

Phe Ala Ser Ala lie His Asp Val Asp His Pro Gly Val Ser Asn Gin 

500 505 510 

Phe Leu lie Asn Thr Asn Ser Glu Leu Ala Leu Met Tyr Asn Asp Ser 

515 520 525 

Ser Val Leu Glu Asn His His Leu Ala Val Gly Phe Lys Leu Leu Gin 

530 535 540 

Glu Glu Asn Cys Asp lie Phe Gin Asn Leu Thr Lys Lys Gin Arg Gin 
545 550 555 560 

Ser Leu Arg Lys Met Val lie Asp lie Val Leu Ala Thr Asp Met Ser 

565 570 575 

Lys His Met Asn Leu Leu Ala Asp Leu Lys Thr Met Val Glu Thr Lys 

580 585 590 

Lys Val Thr Ser Ser Gly Val Leu Leu Leu Asp Asn Tyr Ser Asp Arg 

595 600 605 

lie Gin Val Leu Gin Asn Met Val His Cys Ala Asp Leu Ser Asn Pro 

610 615 620 

Thr Lys Pro Leu Gin Leu Tyr Arg Gin Trp Thr Asp Arg lie Met Glu 
625 630 635 640 

Glu Phe Phe Arg Gin Gly Asp Arg Glu Arg Glu Arg Gly Met Glu lie 

645 650 655 

Ser Pro Met Cys Asp Lys His Asn Ala Ser Val Glu Lys Ser Gin Val 

660 665 670 

Gly Phe lie Asp Tyr lie Val His Pro Leu Trp Glu Thr Trp Ala Asp 

675 680 685 

Leu Val His Pro Asp Ala Gin Asp lie Leu Asp Thr Leu Glu Asp Asn 

690 695 700 

Arg Glu Trp Tyr Gin Ser Thr He Pro Gin Ser Pro Ser Pro Ala Pro 
705 710 715 720 

Asp Asp Pro Glu Glu Gly Arg Gin Gly Gin Thr Glu Lys Phe Gin Phe 

725 730 735 

Glu Leu Thr Leu Glu Glu Asp Gly Glu Ser Asp Thr Glu Lys Asp Ser 

740 745 750 

Gly Ser Gin Val Glu Glu Asp Thr Ser Cys Ser Asp Ser Lys Thr Leu 

755 760 765 

Cys Thr Gin Asp Ser Glu Ser Thr Glu He Pro Leu Asp Glu Gin Val 

770 775 780 

Glu Glu Glu Ala Val Gly Glu Glu Glu Glu Ser Gin Pro Glu Ala Cys 
785 790 795 800 

Val He Asp Asp Arg Ser Pro Asp Thr Thr Gly He Leu Gin Ser Thr 

805 810 815 

Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr Met Val Ser Lys Gly 

820 825 830 

Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val Glu Leu Asp Gly 

835 840 845 

Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 

850 855 860 

Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys 
865 870 875 880 

Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val 
885 890 895 



Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 

900 905 910 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 

915 920 925 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 

930 935 940 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu 
945 950 955 960 

Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 

965 970 975 

Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly He Lys Val Asn 

980 985 990 

Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp 

995 1000 1005 

His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro 

1010 1015 1020 

Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
025 1030 1035 1040 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 

1045 1050 1055 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1060 1065 



(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...3021 
(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



ATG AGC TGG TCA CCT TCC CTG ACA ACG CAG ACA TGT GGG GCC TGG GAA 48 

Met Ser Trp Ser Pro Ser Leu Thr Thr Gin Thr Cys Gly Ala Trp Glu 

15 10 15 

ATG AAA GAG CGC CTT GGG ACA GGG GGA TTT GGA AAT GTC ATC CGA TGG 96 

Met Lys Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val He Arg Trp 
20 25 30 

CAC AAT CAG GAA ACA GGT GAG CAG ATT GCC ATC AAG CAG TGC CGG CAG 144 

His Asn Gin Glu Thr Gly Glu Gin He Ala He Lys Gin Cys Arg Gin 
35 40 45 

GAG CTC AGC CCC CGG AAC CGA GAG CGG TGG TGC CTG GAG ATC CAG ATC 192 

Glu Leu Ser Pro Arg Asn Arg Glu Arg Trp Cys Leu Glu He Gin He 



50 55 60 

ATG AGA AGG CTG ACC CAC CCC AAT GTG GTG GCT GCC CGA GAT GTC CCT 240 
Met Arg Arg Leu Thr His Pro Asn Val Val Ala Ala Arg Asp Val Pro 
65 70 75 80 

GAG GGG ATG CAG AAC TTG GCG CCC AAT GAC CTG CCC CTG CTG GCC ATG 2 88 

Glu Gly Met Gin Asn Leu Ala Pro Asn Asp Leu Pro Leu Leu Ala Met 
85 90 95 

GAG TAC TGC CAA GGA GGA GAT CTC CGG AAG TAC CTG AAC CAG TTT GAG 336 
Glu Tyr Cys Gin Gly Gly Asp Leu Arg Lys Tyr Leu Asn Gin Phe Glu 
100 105 110 

AAC TGC TGT GGT CTG CGG GAA GGT GCC ATC CTC ACC TTG CTG AGT GAC 384 
Asn Cys Cys Gly Leu Arg Glu Gly Ala lie Leu Thr Leu Leu Ser Asp 
115 120 125 

ATT GCC TCT GCG CTT AGA TAC CTT CAT GAA AAC AGA ATC ATC CAT CGG 432 
He Ala Ser Ala Leu Arg Tyr Leu His Glu Asn Arg He He His Arg 
130 135 140 

GAT CTA AAG CCA GAA AAC ATC GTC CTG CAG CAA GGA GAA CAG AGG TTA 480 
Asp Leu Lys Pro Glu Asn He Val Leu Gin Gin Gly Glu Gin Arg Leu 
145 150 155 160 

ATA CAC AAA ATT ATT GAC CTA GGA TAT GCC AAG GAG CTG GAT CAG GGC 528 
He His Lys He He Asp Leu Gly Tyr Ala Lys Glu Leu Asp Gin Gly 
165 170 175 

AGT CTT TGC ACA TCA TTC GTG GGG ACC CTG CAG TAC CTG GCC CCA GAG 576 
Ser Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu 
180 185 190 

CTA CTG GAG CAG CAG AAG TAC ACA GTG ACC GTC GAC TAC TGG AGC TTC 624 
Leu Leu Glu Gin Gin Lys Tyr Thr Val Thr Val Asp Tyr Trp Ser Phe 
195 200 205 

GGC ACC CTG GCC TTT GAG TGC ATC ACG GGC TTC CGG CCC TTC CTC CCC 672 
Gly Thr Leu Ala Phe Glu Cys He Thr Gly Phe Arg Pro Phe Leu Pro 
210 215 220 

AAC TGG CAG CCC GTG CAG TGG CAT TCA AAA GTG CGG CAG AAG AGT GAG 720 
Asn Trp Gin Pro Val Gin Trp His Ser Lys Val Arg Gin Lys Ser Glu 
225 230 235 240 

GTG GAC ATT GTT GTT AGC GAA GAC TTG AAT GGA ACG GTG AAG TTT TCA 7 68 

Val Asp He Val Val Ser Glu Asp Leu Asn Gly Thr Val Lys Phe Ser 
245 250 255 

AGC TCT TTA CCC TAC CCC AAT AAT CTT AAC AGT GTC CTG GCT GAG CGA 816 
Ser Ser Leu Pro Tyr Pro Asn Asn Leu Asn Ser Val Leu Ala Glu Arg 
260 265 270 
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CTG GAG AAG TGG CTG CAA CTG ATG CTG ATG TGG CAC CCC CGA CAG AGG 864 
Leu Glu Lys Trp Leu Gin Leu Met Leu Met Trp His Pro Arg Gin Arg 
275 280 285 

GGC ACG GAT CCC ACG TAT GGG CCC AAT GGC TGC TTC AAG GCC CTG GAT 912 
Gly Thr Asp Pro Thr Tyr Gly Pro Asn Gly Cys Phe Lys Ala Leu Asp 
290 295 300 

GAC ATC TTA AAC TTA AAG CTG GTT CAT ATC TTG AAC ATG GTC ACG GGC 960 
Asp lie Leu Asn Leu Lys Leu Val His lie Leu Asn Met Val Thr Gly 
305 310 315 320 

ACC ATC CAC ACC TAC CCT GTG ACA GAG GAT GAG AGT CTG CAG AGC TTG 1008 
Thr lie His Thr Tyr Pro Val Thr Glu Asp Glu Ser Leu Gin Ser Leu 
325 330 335 

AAG GCC AGA ATC CAA CAG GAC ACG GGC ATC CCA GAG GAG GAC CAG GAG 1056 
Lys Ala Arg lie Gin Gin Asp Thr Gly lie Pro Glu Glu Asp Gin Glu 
340 345 350 

CTG CTG CAG GAA GCG GGC CTG GCG TTG ATC CCC GAT AAG CCT GCC ACT 1104 
Leu Leu Gin Glu Ala Gly Leu Ala Leu He Pro Asp Lys Pro Ala Thr 
355 360 365 

CAG TGT ATT TCA GAC GGC AAG TTA AAT GAG GGC CAC ACA TTG GAC ATG 1152 
Gin Cys He Ser Asp Gly Lys Leu Asn Glu Gly His Thr Leu Asp Met 
370 375 380 

GAT CTT GTT TTT CTC TTT GAC AAC AGT AAA ATC ACC TAT GAG ACT CAG 12 00 

Asp Leu Val Phe Leu Phe Asp Asn Ser Lys He Thr Tyr Glu Thr Gin 
3S5 390 395 400 

ATC TCC CCA CGG CCC CAA CCT GAA AGT GTC AGC TGT ATC CTT CAA GAG 12 48 
He Ser Pro Arg Pro Gin Pro Glu Ser Val Ser Cys lie Leu Gin Glu 
405 410 415 

CCC AAG AGG AAT CTC GCC TTC TTC CAG CTG AGG AAG GTG TGG GGC CAG 12 96 
Pro Lys Arg Asn Leu Ala Phe Phe Gin Leu Arg Lys Val Trp Gly Gin 
420 425 430 

GTC TGG CAC AGC ATC CAG ACC CTG AAG GAA GAT TGC AAC CGG CTG CAG 13 44 
Val Trp His Ser He Gin Thr Leu Lys Glu Asp Cys Asn Arg Leu Gin 
435 440 445 

CAG GGA CAG CGA GCC GCC ATG ATG AAT CTC CTC CGA AAC AAC AGC TGC 1392 
Gin Gly Gin Arg Ala Ala Met Met Asn Leu Leu Arg Asn Asn Ser Cys 
450 455 460 

CTC TCC AAA ATG AAG AAT TCC ATG GCT TCC ATG TCT CAG CAG CTC AAG 1440 
Leu Ser Lys Met Lys Asn Ser Met Ala Ser Met Ser Gin Gin Leu Lys 
465 470 475 480 

GCC AAG TTG GAT TTC TTC AAA ACC AGC ATC CAG ATT GAC CTG GAG AAG 14 88 
Ala Lys Leu Asp Phe Phe Lys Thr Ser He Gin lie Asp Leu Glu Lys 
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485 490 495 

TAC AGC GAG CAA ACC GAG TTT GGG ATC ACA TCA GAT AAA CTG CTG CTG 1536 
Tyr Ser Glu Gin Thr Glu Phe Gly lie Thr Ser Asp Lys Leu Leu Leu 
500 505 510 

GCC TGG AGG GAA ATG GAG CAG GCT GTG GAG CTC TGT GGG CGG GAG AAC 1584 
Ala Trp Arg Glu Met Glu Gin Ala Val Glu Leu Cys Gly Arg Glu Asn 

515 520 525 

GAA GTG AAA CTC CTG GTA GAA CGG ATG ATG GCT CTG CAG ACC GAC ATT 1632 
Glu Val Lys Leu Leu Val Glu Arg Met Met Ala Leu Gin Thr Asp He 
530 535 540 

GTG GAC TTA CAG AGG AGC CCC ATG GGC CGG AAG CAG GGG GGA ACG CTG 1680 
Val Asp Leu Gin Arg Ser Pro Met Gly Arg Lys Gin Gly Gly Thr Leu 
545 550 555 560 

GAC GAC CTA GAG GAG CAA GCA AGG GAG CTG TAC AGG AGA CTA AGG GAA 172 8 
Asp Asp Leu Glu Glu Gin Ala Arg Glu Leu Tyr Arg Arg Leu Arg Glu 
565 570 575 

AAA CCT CGA GAC CAG CGA ACT GAG GGT GAC AGT CAG GAA ATG GTA CGG 177 6 

Lys Pro Arg Asp Gin Arg Thr Glu Gly Asp Ser Gin Glu Met Val Arg 
580 585 590 

CTG CTG CTT CAG GCA ATT CAG AGC TTC GAG AAG AAA GTG CGA GTG ATC 182 4 

Leu Leu Leu Gin Ala He Gin Ser Phe Glu Lys Lys Val Arg Val He 
595 600 605 

TAT ACG CAG CTC AGT AAA ACT GTG GTT TGC AAG CAG AAG GCG CTG GAA 1872 
Tyr Thr Gin Leu Ser Lys Thr Val Val Cys Lys Gin Lys Ala Leu Glu 
610 615 620 

CTG TTG CCC AAG GTG GAA GAG GTG GTG AGC TTA ATG AAT GAG GAT GAG 1920 
Leu Leu Pro Lys Val Glu Glu Val Val Ser Leu Met Asn Glu Asp Glu 
625 630 635 640 

AAG ACT GTT GTC CGG CTG CAG GAG AAG CGG CAG AAG GAG CTC TGG AAT 1968 
Lys Thr Val Val Arg Leu Gin Glu Lys Arg Gin Lys Glu Leu Trp Asn 
645 650 655 

CTC CTG AAG ATT GCT TGT AGC AAG GTC CGT GGT CCT GTC AGT GGA AGC 2016 
Leu Leu Lys He Ala Cys Ser Lys Val Arg Gly Pro Val Ser Gly Ser 
660 665 670 

CCG GAT AGC ATG AAT GCC TCT CGA CTT AGC CAG CCT GGG CAG CTG ATG 2064 
Pro Asp Ser Met Asn Ala Ser Arg Leu Ser Gin Pro Gly Gin Leu Met 
675 680 685 

TCT CAG CCC TCC ACG GCC TCC AAC AGC TTA CCT GAG CCA GCC AAG AAG 2112 
Ser Gin Pro Ser Thr Ala Ser Asn Ser Leu Pro Glu Pro Ala Lys Lys 
690 695 700 
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AGT GAA GAA CTG GTG GCT GAA GCA CAT AAC CTC TGC ACC CTG CTA GAA 2160 
Ser Glu Glu Leu Val Ala Glu Ala His Asn Leu Cys Thr Leu Leu Glu 
705 710 715 720 

AAT GCC ATA CAG GAC ACT GTG AGG GAA CAA GAC CAG AGT TTC ACG GCC 22 08 
Asn Ala lie Gin Asp Thr Val Arg Glu Gin Asp Gin Ser Phe Thr Ala 
725 730 735 

CTA GAC TGG AGC TGG TTA CAG ACG GAA GAA GAA GAG CAC AGC TGC CTG 22 56 
Leu Asp Trp Ser Trp Leu Gin Thr Glu Glu Glu Glu His Ser Cys Leu 
740 745 750 

GAG CAG GCC TCA TGG GTA CCG CGG GCC CGG GAT CCA CCG GTC GCC ACC 2304 
Glu Gin Ala Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr 
755 760 765 

ATG GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 2352 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
770 775 780 

GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 2400 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
785 790 795 800 

GAG GGC GAG GGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 2448 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
805 810 815 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 24 96 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

S20 825 830 

CTG ACC TAC GGC GTG CAG TGC TTC AGC CGC TAC CCC GAC CAC ATG AAG 2 544 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
835 840 845 

CAG CAC GAC TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 2592 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
850 855 860 

CGC ACC ATC TTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 2 640 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
865 870 875 880 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 2 6 88 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
885 890 895 

ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 27 3 6 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
900 905 910 

AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 27 84 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
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915 920 925 

GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 2 832 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
930 935 940 

GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC GGC GAC GGC 2 880 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
945 950 955 960 

CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG TCC GCC CTG 292 8 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

965 970 975 

AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG CTG GAG TTC 297 6 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
980 985 990 

GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG TAC AAG TAA 3 024 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
995 1000 1005 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

Met Ser Trp Ser Pro Ser Leu Thr Thr Gin Thr Cys Gly Ala Trp Glu 

15 10 15 

Met Lys Glu Arg Leu Gly Thr Gly Gly Phe Gly Asn Val He Arg Trp 

20 25 30 

His Asn Gin Glu Thr Gly Glu Gin He Ala lie Lys Gin Cys Arg Gin 

35 40 45 

Glu Leu Ser Pro Arg Asn Arg Glu Arg Trp Cys Leu Glu He Gin He 

50 55 60 

Met Arg Arg Leu Thr His Pro Asn Val Val Ala Ala Arg Asp Val Pro 
65 70 75 80 

Glu Gly Met Gin Asn Leu Ala Pro Asn Asp Leu Pro Leu Leu Ala Met 

85 90 95 

Glu Tyr Cys Gin Gly Gly Asp Leu Arg Lys Tyr Leu Asn Gin Phe Glu 

100 105 110 

Asn Cys Cys Gly Leu Arg Glu Gly Ala He Leu Thr Leu Leu Ser Asp 

115 120 125 

He Ala Ser Ala Leu Arg Tyr Leu His Glu Asn Arg He He His Arg 
130 135 140 
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Asp Leu Lys Pro Glu Asn He Val Leu Gin Gin Gly Glu Gin Arg Leu 
145 150 155 160 

He His Lys He He Asp Leu Gly Tyr Ala Lys Glu Leu Asp Gin Gly 

165 170 175 

Ser Leu Cys Thr Ser Phe Val Gly Thr Leu Gin Tyr Leu Ala Pro Glu 

180 185 190 

Leu Leu Glu Gin Gin Lys Tyr Thr Val Thr Val Asp Tyr Trp Ser Phe 

195 200 205 

Gly Thr Leu Ala Phe Glu Cys He Thr Gly Phe Arg Pro Phe Leu Pro 

210 215 220 

Asn Trp Gin Pro Val Gin Trp His Ser Lys Val Arg Gin Lys Ser Glu 
225 230 235 240 

Val Asp He Val Val Ser Glu Asp Leu Asn Gly Thr Val Lys Phe Ser 

245 250 255 

Ser Ser Leu Pro Tyr Pro Asn Asn Leu Asn Ser Val Leu Ala Glu Arg 

260 265 270 

Leu Glu Lys Trp Leu Gin Leu Met Leu Met Trp His Pro Arg Gin Arg 

275 280 285 

Gly Thr Asp Pro Thr Tyr Gly Pro Asn Gly Cys Phe Lys Ala Leu Asp 

290 295 300 

Asp lie Leu Asn Leu Lys Leu Val His He Leu Asn Met Val Thr Gly 
305 310 315 320 

Thr He His Thr Tyr Pro Val Thr Glu Asp Glu Ser Leu Gin Ser Leu 

325 330 335 

Lys Ala Arg He Gin Gin Asp Thr Gly He Pro Glu Glu Asp Gin Glu 

340 345 350 

Leu Leu Gin Glu Ala Gly Leu Ala Leu He Pro Asp Lys Pro Ala Thr 

355 360 365 

Gin Cys lie Ser Asp Gly Lys Leu Asn Glu Gly His Thr Leu Asp Met 

370 375 380 

Asp Leu Val Phe Leu Phe Asp Asn Ser Lys He Thr Tyr Glu Thr Gin 
385 390 395 400 

lie Ser Pro Arg Pro Gin Pro Glu Ser Val Ser Cys He Leu Gin Glu 

405 410 415 

Pro Lys Arg Asn Leu Ala Phe Phe Gin Leu Arg Lys Val Trp Gly Gin 

420 425 430 

Val Trp His Ser He Gin Thr Leu Lys Glu Asp Cys Asn Arg Leu Gin 

435 440 445 

Gin Gly Gin Arg Ala Ala Met Met Asn Leu Leu Arg Asn Asn Ser Cys 

450 455 460 

Leu Ser Lys Met Lys Asn Ser Met Ala Ser Met Ser Gin Gin Leu Lys 
465 470 475 480 

Ala Lys Leu Asp Phe Phe Lys Thr Ser He Gin lie Asp Leu Glu Lys 

485 490 495 

Tyr Ser Glu Gin Thr Glu Phe Gly lie Thr Ser Asp Lys Leu Leu Leu 

500 505 510 

Ala Trp Arg Glu Met Glu Gin Ala Val Glu Leu Cys Gly Arg Glu Asn 

515 520 525 

Glu Val Lys Leu Leu Val Glu Arg Met Met Ala Leu Gin Thr Asp He 

530 535 540 

Val Asp Leu Gin Arg Ser Pro Met Gly Arg Lys Gin Gly Gly Thr Leu 
545 550 555 560 

Asp Asp Leu Glu Glu Gin Ala Arg Glu Leu Tyr Arg Arg Leu Arg Glu 
565 570 575 
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Lys Pro Arg Asp Gin Arg Thr Glu Gly Asp Ser Gin Glu Met Val Arg 

580 585 590 

Leu Leu Leu Gin Ala lie Gin Ser Phe Glu Lys Lys Val Arg Val lie 

595 600 605 

Tyr Thr Gin Leu Ser Lys Thr Val Val Cys Lys Gin Lys Ala Leu Glu 

610 615 620 

Leu Leu Pro Lys Val Glu Glu Val Val Ser Leu Met Asn Glu Asp Glu 
625 630 635 640 

Lys Thr Val Val Arg Leu Gin Glu Lys Arg Gin Lys Glu Leu Trp Asn 

645 650 655 

Leu Leu Lys lie Ala Cys Ser Lys Val Arg Gly Pro Val Ser Gly Ser 

660 665 670 

Pro Asp Ser Met Asn Ala Ser Arg Leu Ser Gin Pro Gly Gin Leu Met 

675 680 685 

Ser Gin Pro Ser Thr Ala Ser Asn Ser Leu Pro Glu Pro Ala Lys Lys 

690 695 700 

Ser Glu Glu Leu Val Ala Glu Ala His Asn Leu Cys Thr Leu Leu Glu 
705 710 715 720 

Asn Ala He Gin Asp Thr Val Arg Glu Gin Asp Gin Ser Phe Thr Ala 

725 730 735 

Leu Asp Trp Ser Trp Leu Gin Thr Glu Glu Glu Glu His Ser Cys Leu 

740 745 750 

Glu Gin Ala Ser Trp Val Pro Arg Ala Arg Asp Pro Pro Val Ala Thr 

755 760 765 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

770 775 780 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
785 790 795 800 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

805 810 815 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

820 825 830 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

835 840 845 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

850 855 860 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
865 870 875 880 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

885 890 895 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

900 905 910 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

915 920 925 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

930 935 940 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
945 950 955 960 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

965 970 975 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

980 985 990 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 



995 



1000 



1005 
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(2) INFORMATION FOR SEQ ID NO:154: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .2790 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ATG ATG CAC GTG AAT AAT TTT CCC TTT AGA AGG CAT TCC TGG ATA TGT 48 
Met Met His Val Asn Asn Phe Pro Phe Arg Arg His Ser Trp lie Cys 
15 10 15 

TTT GAT GTG GAC AAT GGC ACA TCT GCG GGA CGG AGT CCC TTG GAT CCC 96 
Phe Asp Val Asp Asn Gly Thr Ser Ala Gly Arg Ser Pro Leu Asp Pro 
20 25 30 

ATG ACC AGO CCA GGA TCC GGG CTA ATT CTC CAA GCA AAT TTT GTC CAC 144 
Met Thr Ser Pro Gly Ser Gly Leu lie Leu Gin Ala Asn Phe Val His 
35 40 45 

AGT CAA CGA CGG GAG TCC TTC CTG TAT CGA TCC GAC AGC GAT TAT GAC 192 
Ser Gin Arg Arg Glu Ser Phe Leu Tyr Arg Ser Asp Ser Asp Tyr Asp 
50 55 60 

CTC TCT CCA AAG TCT ATG TCC CGG AAC TCC TCC ATT GCC AGT GAT ATA 240 
Leu Ser Pro Lys Ser Met Ser Arg Asn Ser Ser lie Ala Ser Asp lie 
65 70 75 80 

CAC GGA GAT GAC TTG ATT GTG ACT CCA TTT GCT CAG GTC TTG GCC AGT 2 88 

His Gly Asp Asp Leu lie Val Thr Pro Phe Ala Gin Val Leu Ala Ser 
85 90 95 

CTG CGA ACT GTA CGA AAC AAC TTT GCT GCA TTA ACT AAT TTG CAA GAT 3 36 

Leu Arg Thr Val Arg Asn Asn Phe Ala Ala Leu Thr Asn Leu Gin Asp 
100 105 HO 

CGA GCA CCT AGC AAA AGA TCA CCC ATG TGC AAC CAA CCA TCC ATC AAC 384 
Arg Ala Pro Ser Lys Arg Ser Pro Met Cys Asn Gin Pro Ser lie Asn 
115 120 125 

AAA GCC ACC ATA ACA GAG GAG GCC TAC CAG AAA CTG GCC AGC GAG ACC 432 
Lys Ala Thr lie Thr Glu Glu Ala Tyr Gin Lys Leu Ala Ser Glu Thr 
130 135 140 



CTG GAG GAG CTG GAC TGG TGT CTG GAC CAG CTA GAG ACC CTA CAG ACC 4 80 

Leu Glu Glu Leu Asp Trp Cys Leu Asp Gin Leu Glu Thr Leu Gin Thr 
145 150 155 160 

AGG CAC TCC GTC AGT GAG ATG GCC TCC AAC AAG TTT AAA AGG ATG CTT 528 
Arg His Ser Val Ser Glu Met Ala Ser Asn Lys Phe Lys Arg Met Leu 
165 170 175 

AAT CGG GAG CTC ACC CAT CTC TCT GAA ATG AGT CGG TCT GGA AAT CAA 5 "7 6 

Asn Arg Glu Leu Thr His Leu Ser Glu Met Ser Arg Ser Gly Asn Gin 
180 185 190 

GTG TCA GAG TTT ATA TCA AAC AC A TTC TTA GAT AAG CAA CAT GAA GTG 624 
Val Ser Glu Phe lie Ser Asn Thr Phe Leu Asp Lys Gin His Glu Val 
195 200 205 

GAA ATT CCT TCT CCA ACT CAG AAG GAA AAG GAG AAA AAG AAA AGA CCA 672 
Glu lie Pro Ser Pro Thr Gin Lys Glu Lys Glu Lys Lys Lys Arg Pro 
210 215 220 

ATG TCT CAG ATC AGT GGA GTC AAG AAA TTG ATG CAC AGC TCT AGT CTG 720 
Met Ser Gin lie Ser Gly Val Lys Lys Leu Met His Ser Ser Ser Leu 
225 230 235 240 

ACT AAT TCA AGT ATC CCA AGG TTT GGA GTT AAA ACT GAA CAA GAA GAT 768 
Thr Asn Ser Ser lie Pro Arg Phe Gly Val Lys Thr Glu Gin Glu Asp 
245 250 255 

GTC CTT GCC AAG GAA CTA GAA GAT GTG AAC AAA TGG GGT CTT CAT GTT 816 
Val Leu Ala Lys Glu Leu Glu Asp Val Asn Lys Trp Gly Leu His Val 
260 265 270 

TTC AGA ATA GCA GAG TTG TCT GGT AAC CGG CCC TTG ACT GTT ATC ATG 864 
Phe Arg lie Ala Glu Leu Ser Gly Asn Arg Pro Leu Thr Val He Met 
275 280 285 

CAC ACC ATT TTT CAG GAA CGG GAT TTA TTA AAA ACA TTT AAA ATT CCA 912 
His Thr He Phe Gin Glu Arg Asp Leu Leu Lys Thr Phe Lys He Pro 
290 295 300 

GTA GAT ACT TTA ATT ACA TAT CTT ATG ACT CTC GAA GAC CAT TAC CAT 960 
Val Asp Thr Leu He Thr Tyr Leu Met Thr Leu Glu Asp His Tyr His 
305 310 315 320 

GCT GAT GTG GCC TAT CAC AAC AAT ATC CAT GCT GCA GAT GTT GTC CAG 1008 
Ala Asp Val Ala Tyr His Asn Asn He His Ala Ala Asp Val Val Gin 
325 330 335 

TCT ACT CAT GTG CTA TTA TCT ACA CCT GCT TTG GAG GCT GTG TTT ACA 1056 
Ser Thr His Val Leu Leu Ser Thr Pro Ala Leu Glu Ala Val Phe Thr 
340 345 350 

GAT TTG GAG ATT CTT GCA GCA ATT TTT GCC AGT GCA ATA CAT GAT GTA 1104 
Asp Leu Glu He Leu Ala Ala He Phe Ala Ser Ala He His Asp Val 
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355 360 365 

GAT CAT CCT GGT GTG TCC AAT CAA TTT CTG ATC AAT ACA AAC TCT GAA 1152 
Asp His Pro Gly Val Ser Asn Gin Phe Leu lie Asn Thr Asn Ser Glu 
370 375 380 

CTT GCC TTG ATG TAC AAT GAT TCC TCA GTC TTA GAG AAC CAT CAT TTG 1200 
Leu Ala Leu Met Tyr Asn Asp Ser Ser Val Leu Glu Asn His His Leu 

385 390 395 400 

GCT GTG GGC TTT AAA TTG CTT CAG GAA GAA AAC TGT GAC ATT TTC CAG 1248 
Ala Val Gly Phe Lys Leu Leu Gin Glu Glu Asn Cys Asp lie Phe Gin 
405 410 415 

AAT TTG ACC AAA AAA CAA AGA CAA TCT TTA AGG AAA ATG GTC ATT GAC 1296 
Asn Leu Thr Lys Lys Gin Arg Gin Ser Leu Arg Lys Met Val lie Asp 
420 425 430 

ATC GTA CTT GCA ACA GAT ATG TCA AAA CAC ATG AAT CTA CTG GCT GAT 1344 
lie Val Leu Ala Thr Asp Met Ser Lys His Met Asn Leu Leu Ala Asp 
435 440 445 

TTG AAG ACT ATG GTT GAA ACT AAG AAA GTG ACA AGC TCT GGA GTT CTT 1392 
Leu Lys Thr Met Val Glu Thr Lys Lys Val Thr Ser Ser Gly Val Leu 
450 455 460 

CTT CTT GAT AAT TAT TCC GAT AGG ATT CAG GTT CTT CAG AAT ATG GTG 1440 
Leu Leu Asp Asn Tyr Ser Asp Arg lie Gin Val Leu Gin Asn Met Val 
465 470 475 480 

CAC TGT GCA GAT CTG AGC AAC CCA ACA AAG CCT CTC CAG CTG TAC CGC 1488 
His Cys Ala Asp Leu Ser Asn Pro Thr Lys Pro Leu Gin Leu Tyr Arg 
485 490 495 

CAG TGG ACG GAC CGG ATA ATG GAG GAG TTC TTC CGC CAA GGA GAC CGA 1536 
Gin Trp Thr Asp Arg lie Met Glu Glu Phe Phe Arg Gin Gly Asp Arg 
500 505 510 

GAG AGG GAA CGT GGC ATG GAG ATA AGC CCC ATG TGT GAC AAG CAC AAT 1584 
Glu Arg Glu Arg Gly Met Glu lie Ser Pro Met Cys Asp Lys His Asn 
515 520 525 

GCT TCC GTG GAA AAA TCA CAG GTG GGC TTC ATA GAC TAT ATT GTT CAT 1632 
Ala Ser Val Glu Lys Ser Gin Val Gly Phe lie Asp Tyr lie Val His 
530 535 540 

CCC CTC TGG GAG ACA TGG GCA GAC CTC GTC CAC CCT GAC GCC CAG GAT 1680 
Pro Leu Trp Glu Thr Trp Ala Asp Leu Val His Pro Asp Ala Gin Asp 
545 550 555 560 

ATT TTG GAC ACT TTG GAG GAC AAT CGT GAA TGG TAC CAG AGC ACA ATC 17 2 8 

lie Leu Asp Thr Leu Glu Asp Asn Arg Glu Trp Tyr Gin Ser Thr He 
565 570 575 
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CCT CAG AGO CCC TCT 
Pro Gin Ser Pro Ser 
580 

GGT CAA ACT GAG AAA 
Gly Gin Thr Glu Lys 
595 

GAG TCA GAC ACG GAA 
Glu Ser Asp Thr Glu 
610 

AGC TGC AGT GAC TCC 
Ser Cys Ser Asp Ser 
625 

GAA ATT CCC CTT 
Glu He Pro Leu 



GAG GAA AGC CAG 
Glu Glu Ser Gin 
660 

ACG ACG GGA ATT 
Thr Thr Gly He 
675 

GTC GCC ACC ATG 
Val Ala Thr Met 
690 

CCC ATC CTG GTC 
Pro He Leu Val 
705 

GTG TCC GGC GAG 
Val Ser Gly Glu 



AAG TTC ATC TGC 
Lys Phe He Cys 
740 

GTG ACC ACC CTG 
Val Thr Thr Leu 
755 

CAC ATG AAG CAG 
His Met Lys Gin 
770 

GTC CAG GAG CGC 
Val Gin Glu Arg 



CCT GCA CCT GAT 
Pro Ala Pro Asp 
585 

TTC CAG TTT GAA 
Phe Gin Phe Glu 
600 

AAG GAC AGT GGC 
Lys Asp Ser Gly 
615 

AAG ACT CTT TGT 
Lys Thr Leu Cys 
630 

GAT GAA CAG GTT GAA 
Asp Glu Gin Val Glu 
645 

CCT GAA GCC TGT GTC 
Pro Glu Ala Cys Val 
665 

CTG CAG TCG ACG GTA 
Leu Gin Ser Thr Val 
680 

GTG AGC AAG GGC GAG 
Val Ser Lys Gly Glu 
695 

GAG CTG GAC GGC GAC 
Glu Leu Asp Gly Asp 
710 

GGC GAG GGC GAT GCC 
Gly Glu Gly Asp Ala 
725 

ACC ACC GGC AAG CTG 
Thr Thr Gly Lys Leu 
745 

ACC TAC GGC GTG CAG 
Thr Tyr Gly Val Gin 
760 

CAC GAC TTC TTC AAG 
His Asp Phe Phe Lys 
775 

ACC ATC TTC TTC AAG 
Thr He Phe Phe Lys 



GAC CCA GAG GAG GGC 
Asp Pro Glu Glu Gly 
590 

CTA ACT TTA GAG GAA 
Leu Thr Leu Glu Glu 

605 

AGT CAA GTG GAA GAA 
Ser Gin Val Glu Glu 
620 

ACT CAA GAC TCA GAG 
Thr Gin Asp Ser Glu 
635 

GAG GAG GCA GTA GGG 
Glu Glu Ala Val Gly 
650 

ATA GAT GAT CGT TCT 
He Asp Asp Arg Ser 
670 

CCG CGG GCC CGG GAT 
Pro Arg Ala Arg Asp 
685 

GAG CTG TTC ACC GGG 
Glu Leu Phe Thr Gly 

700 

GTA AAC GGC CAC AAG 
Val Asn Gly His Lys 
715 

ACC TAC GGC AAG CTG 
Thr Tyr Gly Lys Leu 
730 

CCC GTG CCC TGG CCC 
Pro Val Pro Trp Pro 
750 

TGC TTC AGC CGC TAC 
Cys Phe Ser Arg Tyr 
765 

TCC GCC ATG CCC GAA 
Ser Ala Met Pro Glu 
780 

GAC GAC GGC AAC TAC 
Asp Asp Gly Asn Tyr 



CGG CAG 17 7 6 
Arg Gin 

GAT GGT 182 4 
Asp Gly 

GAC ACT 187 2 

Asp Thr 

TCT ACT 1920 
Ser Thr 
640 

GAA GAA 1968 
Glu Glu 
655 

CCT GAC 2016 
Pro Asp 

CCA CCG 2064 
Pro Pro 

GTG GTG 2112 
Val Val 

TTC AGC 2160 
Phe Ser 
720 

ACC CTG 2208 
Thr Leu 
735 

ACC CTC 2256 
Thr Leu 

CCC GAC 23 04 
Pro Asp 

GGC TAC 2 3 52 
Gly Tyr 

AAG ACC 2400 
Lys Thr 
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785 790 795 800 

CGC GCC GAG GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG 2448 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu 
805 810 815 

CTG AAG GGC ATC GAC TTC AAG GAG GAC GGC AAC ATC CTG GGG CAC AAG 2496 
Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys 

820 825 830 

CTG GAG TAC AAC TAC AAC AGC CAC AAC GTC TAT ATC ATG GCC GAC AAG 2544 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 
835 840 845 

CAG AAG AAC GGC ATC AAG GTG AAC TTC AAG ATC CGC CAC AAC ATC GAG 2 592 
Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu 
850 855 860 

GAC GGC AGC GTG CAG CTC GCC GAC CAC TAC CAG CAG AAC ACC CCC ATC 2640 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He 
865 870 875 880 

GGC GAC GGC CCC GTG CTG CTG CCC GAC AAC CAC TAC CTG AGC ACC CAG 2 688 
Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
885 890 895 

TCC GCC CTG AGC AAA GAC CCC AAC GAG AAG CGC GAT CAC ATG GTC CTG 2736 
Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
900 905 910 

CTG GAG TTC GTG ACC GCC GCC GGG ATC ACT CTC GGC ATG GAC GAG CTG 27 84 
Leu Glu Phe Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu 
915 920 925 

TAC AAG TAA 2793 
Tyr Lys 
930 



(2) INFORMATION FOR SEQ ID NO: 15 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



Met Met His Val Asn Asn Phe Pro Phe Arg Arg His Ser Trp He Cys 
15 10 15 



Phe Asp Val Asp 
20 

Met Thr Ser Pro 
35 

Ser Gin Arg Arg 
50 

Leu Ser Pro Lys 
65 

His Gly Asp Asp 

Leu Arg Thr Val 
100 

Arg Ala Pro Ser 
115 

Lys Ala Thr He 
130 

Leu Glu Glu Leu 
145 

Arg His Ser Val 

Asn Arg Glu Leu 
180 

Val Ser Glu Phe 
195 

Glu He Pro Ser 
210 

Met Ser Gin He 
225 

Thr Asn Ser Ser 

Val Leu Ala Lys 
260 

Phe Arg He Ala 
275 

His Thr He Phe 
290 

Val Asp Thr Leu 
305 

Ala Asp Val Ala 

Ser Thr His Val 
340 

Asp Leu Glu He 
355 

Asp His Pro Gly 
370 

Leu Ala Leu Met 
385 

Ala Val Gly Phe 

Asn Leu Thr Lys 
420 

He Val Leu Ala 
435 



Asn Gly Thr Ser 

Gly Ser Gly Leu 
40 

Glu Ser Phe Leu 
55 

Ser Met Ser Arg 
70 

Leu He Val Thr 
85 

Arg Asn Asn Phe 

Lys Arg Ser Pro 

120 

Thr Glu Glu Ala 
135 

Asp Trp Cys Leu 
150 

Ser Glu Met Ala 
165 

Thr His Leu Ser 

He Ser Asn Thr 

200 

Pro Thr Gin Lys 
215 

Ser Gly Val Lys 
230 

He Pro Arg Phe 
245 

Glu Leu Glu Asp 

Glu Leu Ser Gly 
280 

Gin Glu Arg Asp 
295 

He Thr Tyr Leu 
310 

Tyr His Asn Asn 
325 

Leu Leu Ser Thr 

Leu Ala Ala He 
360 

Val Ser Asn Gin 
375 

Tyr Asn Asp Ser 
390 

Lys Leu Leu Gin 
405 

Lys Gin Arg Gin 

Thr Asp Met Ser 
440 



Ala Gly Arg Ser 
25 

lie Leu Gin Ala 

Tyr Arg Ser Asp 
60 

Asn Ser Ser He 
75 

Pro Phe Ala Gin 
90 

Ala Ala Leu Thr 
105 

Met Cys Asn Gin 

Tyr Gin Lys Leu 
140 

Asp Gin Leu Glu 
155 

Ser Asn Lys Phe 
170 

Glu Met Ser Arg 
185 

Phe Leu Asp Lys 

Glu Lys Glu Lys 
220 

Lys Leu Met His 
235 

Gly Val Lys Thr 
250 

Val Asn Lys Trp 
265 

Asn Arg Pro Leu 

Leu Leu Lys Thr 
300 

Met Thr Leu Glu 
315 

He His Ala Ala 
330 

Pro Ala Leu Glu 
345 

Phe Ala Ser Ala 

Phe Leu lie Asn 
380 

Ser Val Leu Glu 
395 

Glu Glu Asn Cys 
410 

Ser Leu Arg Lys 
425 

Lys His Met Asn 



Pro Leu Asp Pre 
30 

Asn Phe Val His 
45 

Ser Asp Tyr Asp 

Ala Ser Asp He 
80 

Val Leu Ala Ser 

95 

Asn Leu Gin Asp 
110 

Pro Ser lie Asn 
125 

Ala Ser Glu Thr 

Thr Leu Gin Thr 
160 

Lys Arg Met Leu 
175 

Ser Gly Asn Gin 
190 

Gin His Glu Val 
205 

Lys Lys Arg Pro 

Ser Ser Ser Leu 
240 

Glu Gin Glu Asp 

255 

Gly Leu His Val 
270 

Thr Val He Met 
285 

Phe Lys lie Pro 

Asp His Tyr His 

320 

Asp Val Val Gin 
335 

Ala Val Phe Thr 

350 

He His Asp Val 
365 

Thr Asn Ser Glu 

Asn His His Leu 
400 

Asp He Phe Gin 
415 

Met Val He Asp 
430 

Leu Leu Ala Asp 
445 



Leu Lys Thr Met 
450 

Leu Leu Asp Asn 
465 

His Cys Ala Asp 

Gin Trp Thr Asp 
500 

Glu Arg Glu Arg 
515 

Ala Ser Val Glu 
530 

Pro Leu Trp Glu 
545 

lie Leu Asp Thr 

Pro Gin Ser Pro 
580 

Gly Gin Thr Glu 
595 

Glu Ser Asp Thr 
610 

Ser Cys Ser Asp 
625 

Glu lie Pro Leu 

Glu Glu Ser Gin 
660 

Thr Thr Gly He 
675 

Val Ala Thr Met 

690 

Pro He Leu Val 
705 

Val Ser Gly Glu 

Lys Phe He Cys 
740 

Val Thr Thr Leu 
755 

His Met Lys Gin 
770 

Val Gin Glu Arg 
785 

Arg Ala Glu Val 

Leu Lys Gly He 
820 

Leu Glu Tyr Asn 
835 

Gin Lys Asn Gly 
850 

Asp Gly Ser Val 
865 



Val Glu Thr Lys 
455 

Tyr Ser Asp Arg 
470 

Leu Ser Asn Pro 
485 

Arg lie Met Glu 

Gly Met Glu He 
520 

Lys Ser Gin Val 
535 

Thr Trp Ala Asp 
550 

Leu Glu Asp Asn 
565 

Ser Pro Ala Pro 

Lys Phe Gin Phe 
600 

Glu Lys Asp Ser 
615 

Ser Lys Thr Leu 
630 

Asp Glu Gin Val 
645 

Pro Glu Ala Cys 

Leu Gin Ser Thr 
680 

Val Ser Lys Gly 
695 

Glu Leu Asp Gly 
710 

Gly Glu Gly Asp 
725 

Thr Thr Gly Lys 

Thr Tyr Gly Val 
760 

His Asp Phe Phe 
775 

Thr He Phe Phe 
790 

Lys Phe Glu Gly 
805 

Asp Phe Lys Glu 

Tyr Asn Ser His 
840 

He Lys Val Asn 
855 

Gin Leu Ala Asp 
87C 



Lys Val Thr Ser 
460 

He Gin Val Leu 
475 

Thr Lys Pro Leu 
490 

Glu Phe Phe Arg 
505 

Ser Pro Met Cys 

Gly Phe He Asp 
540 

Leu Val His Pro 
555 

Arg Glu Trp Tyr 
570 

Asp Asp Pro Glu 
585 

Glu Leu Thr Leu 

Gly Ser Gin Val 
620 

Cys Thr Gin Asp 
635 

Glu Glu Glu Ala 
650 

Val He Asp Asp 
665 

Val Pro Arg Ala 

Glu Glu Leu Phe 

700 

Asp Val Asn Gly 
715 

Ala Thr Tyr Gly 
730 

Leu Pro Val Pro 
745 

Gin Cys Phe Ser 

Lys Ser Ala Met 
780 

Lys Asp Asp Gly 
795 

Asp Thr Leu Val 
810 

Asp Gly Asn He 
825 

Asn Val Tyr He 

Phe Lys He Arg 
860 

His Tyr Gin Gin 
875 



Ser Gly Val Leu 

Gin Asn Met Val 
480 

Gin Leu Tyr Arg 
495 

Gin Gly Asp Arg 
510 

Asp Lys His Asn 
525 

Tyr He Val His 

Asp Ala Gin Asp 
560 

Gin Ser Thr He 
575 

Glu Gly Arg Gin 
590 

Glu Glu Asp Gly 
605 

Glu Glu Asp Thr 

Ser Glu Ser Thr 
640 

Val Gly Glu Glu 
655 

Arg Ser Pro Asp 
670 

Arg Asp Pro Pro 
685 

Thr Gly Val Val 

His Lys Phe Ser 
720 

Lys Leu Thr Leu 
735 

Trp Pro Thr Leu 
750 

Arg Tyr Pro Asp 
765 

Pro Glu Gly Tyr 

Asn Tyr Lys Thr 
800 

Asn Arg He Glu 
815 

Leu Gly His Lys 
830 

Met Ala Asp Lys 
845 

His Asn lie Glu 

Asn Thr Pro He 
880 



Gly 



Asp 



Gly Pro Val Leu Leu Pro Asp 
885 



Asn His Tyr Leu Ser Thr Gin 
890 895 
Lys Arg Asp His Met Val Leu 



Ser 



Ala 



Leu Ser Lys Asp Pro Asn Glu 



Leu 



Glu 



900 905 
Phe Val Thr Ala Ala Gly lie 
915 920 



Thr Leu Gly Met Asp Glu Leu 
925 



910 



Tyr 



Lys 
930 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
GTAAGCTTCG AACATGATGC ACGTGAATAA TTTTCCC 37 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
GTAAGCTTCG AACATGGAGG CAGAGGGCAG CAGC 34 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 
GTAAGCTTCG AACATGGCTC AGCAGACAAG CCCG 34 
(2) INFORMATION FOR SEQ ID NO:159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



>2<P 3 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 
GTG AATTCCC GTCGTGTCAG GAGAAGCATC ATCTATG 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 
GTGAATTCAA CCATGGAGCG GGCC 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 
GTGGTACCCA GTTCCGCTTG GCC 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 
GTCTCGAGGC AAGATGGCTG ACCC 

C) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
GTGGATCCGA GCTCTTGACT TCGGG 2 5 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
GTAAGCTTAC ATGAGCTGGT CACCTTCCCT G 31 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



GTGGTACCCA TGAGGCCTGC TCCAG 
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Mocffaget PD 
f 5 OKI, 1998 

Attorney Docket No. 544 1 .003 

METHOD AND APPARATUS FOR HIGH DENSITY 
FORMAT SCREENING FOR BIOACTIVE MOLECULES 

FIELD OF THE INVENTION 

The invention relates to a method and apparatus for screening large numbers of 
molecules for biological activities. 

BACKGROUND OF THE INVENTION 

Current technology is able to generate large numbers of molecules which may possess 
potential therapeutic value. Compounds having potentially interesting biological activity may 
be products of combinatorial or traditional chemistry, a natural product, proteins isolated by 
one- or two-dimensional gel electrophoresis, or compounds secreted from or expressed by 
natural or genetically modified animal, plant, microbial or fungal cells (or parts thereof), or 
displayed by natural or genetically modified viral or phage particles. 

Screening methods have been developed which achieve very high throughputs of test 
compounds. Such methods are termed Ultra High Throughput Screening (UHTS). The 
present generation of UHTS machines rely upon essentially serial additions of test 
compounds, usually one test compound per discrete test well. Test well array densities range 
from between 96 to 3456 wells per plate. Such UHTS machines require sophisticated 
technologies to dispense microvolumes of many different fluids to selected locations, and also 
require that the detecting surface for each test molecule generally be separated from other 
detecting surfaces within the array. 

There is a need to develop a method which allows simultaneous screening of large 
numbers of test compounds for biological activity and potential therapeutic use while 
avoiding the complications associated with dispensing multiple fluid microvolumes. 

BRIEF SUMMARY OF THE INVENTION 

The invention is directed to a screening method which eliminates the need for 
delivering microfluid volumes and allows simultaneous parallel screening of large numbers of 
test compounds. Accordingly, the invention is drawn to a method for screening test 
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compounds for bioactivity, by contacting an array of test compounds with a detector layer 
capable of detecting bioactivity, wherein a cell response is indicative of bioactivity. 

The method of the invention is a high throughput system for parallel screening of a 
large number of test compounds. In one embodiment of the method of the invention, 96 to 
10,000 test compounds are simultaneously screened for bioactivity in an assay; in a more 
specific embodiment, 96 to 3456 test compounds are simultaneously screened for bioactivity. 

In a more specific embodiment, invention is drawn to a method for screening test 
compounds for bioactivity, comprising: 

(a) contacting a solid support comprising an array of test compounds with a liquid 
layer, wherein the liquid layer is in immediate contact with a detector layer and wherein each 
test compound comes into contact with a localized portion of the liquid layer; and 

(b) registering a response of the detector layer to the test compound, wherein a 
bioactive test compound is identified. 

By "high throughput screening" is meant a method able to screen large number of test 
compounds for biological activity within a given machine time (i.e. at a rate anywhere from 
1 00 to 1 00,000 compounds per hour per machine). 

The term "parallel screening" refers to a method by which very many compounds are 
applied simultaneously to the detector layer, and similarly, signals from that detector layer are 
collected contemporaneously rather than sequentially. 

By "array" is meant a regular two-dimensional arrangement of test compounds by 
which compounds are disposed at the nodes of a rectilinear grid pattern whereby a compound 
position can be identified by a simple 2-dimensional coordinate. 

A "detector layer" means any two-dimensional system which can be used to report 
biologically relevant information. In one specific embodiment of the method of the invention 
the detector layer is a monolayer of living cells loaded with a fluorescent reporter dye such as 
Fluo-3. 

By "bioactive" or "bioactivity" is meant an action or influence of a test compound 
upon the detector layer which results in a response from the detector layer that has direct 
biological significance or can be interpreted as being a biologically relevant response. 
Bioactive agents have the ability to effect physiological parameters of living cells and tissues. 
Bioactivity includes inducing or suppressing the expression of a protein, activating or 
inhibiting transcription of a gene, and/or effecting cellular function(s) such as, for example, 
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intracellular movement and storage of calcium ions, and membrane transportation. 

The capacity of a test compound to affect a detector layer, i.e. bioactivity, may be 
determined in a number of ways known to the art. In specific embodiments of the method of 
the invention, bioactivity is determined by changes or movements of fluorescent probes 
present in the detector layer which indicate changes in ionic content, cell metabolism, growth 
or viability. In a preferred method of the invention, living cells form the detector layer and 
have specific protein components tagged with a fluorescent agent, such as green fluorescent 
protein (GFP); changes in GFP fluorescence or distribution within cells indicate a particular 
cellular response which may be selected for identification of bioactivity. 

The phrase "a change in fluorescence" means any change in absorption properties, 
such as wavelength and intensity, or any change in spectral properties of the emitted light, 
such as a change of wavelength, fluorescence lifetime, intensity or polarization. 

A "solid support comprising an array of multiple test compounds" or similar terms, 
mean a fixed matrix to which test compounds have been fixed. As an example, the solid 
support of the invention includes a membrane or other surface comprising an array of printed 
test compounds. In one specific embodiment of the invention, the test compounds are 
deposited as discrete spots on a porous track-etched polycarbonate membrane 10 to 20 
microns thickness, the spots being between 10 microns to 2 mm diameter. The quantity of 
compound contained in each discrete spot will depend on the concentration of the stock 
solution from which it was derived, and the volume of that stock solution applied to the 
support. In another specific embodiment of the invention, compounds are printed onto a non- 
porous solid support which is optically clear. 

By "test compounds" is meant a fixed array of compounds to be screened for ability to 
effect physiological parameters of a cell or tissue. In one embodiment, the test compounds 
are proteins or peptides generated by combinatorial protein chemical methods known to the 
art. In another embodiment, the test compounds are chemical compounds generated by 
combinatorial chemistry methods known in the art. In another embodiments, the test 
compounds are chemical compounds which are naturally occurring compounds more or less 
purified from their native state, are the products of genetically engineered cells, or are viral or 
bacteriophage particles engineered to display compounds upon their surfaces (phage display). 

In one embodiment, the detector layer is an undemarcated area of living cells growing 
on a flat culture surface. The cells on this surface may or may not be grown to confluence, 
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may be transformed and/or engineered cells, or directly derived from animal tissues and 
grown as primary cell culture. 

In one embodiment, a test compound reaches the detector layer by diffusion through a 
porous membrane to a liquid layer immediately overlaying the detector layer. A variety of 
commercially available porous membranes are useful in the method of the invention. A 
preferred porous membrane is a track-etched polyester or polycarbonate support in which 
parallel channels of identical size are formed by a selective etching process following 
exposure of the membrane to a source of high energy ions. The method of the invention 
allows each test compound affixed to a solid support to come into contact with a limited fluid 
volume, which fluid volume is in immediate contact with the detector layer. In one 
embodiment, each test compound contacts the detector layer by diffusion through a liquid- 
containing channel directly adjacent to the detector layer. 

One advantage of the method of the invention is that it allows massive parallel 
screening of a large array of test compounds for biological activity. When living cells are the 
detector layer of the invention, they are maintained under physiologically viable conditions. 
Provision of these conditions requires the use of solutions able to supply essential nutrients 
and buffer pH changes normal to the continued growth of living cells. Such solutions may be 
complete cell culture media (i.e. any of those commercially available, for instance from Life 
Technologies Ltd.), optionally supplemented with antibiotics and serum preparations for 
optimal cell growth conditions. Buffer solutions may also be of the type known as 
"chemically defined". Cells will also require controlled temperature conditions, in the range 
20° to 37°C, and the provision of gases essential to continued cell growth and maintenance of 
buffer capacity (O2, and optionally 5% C02, depending on the type of buffer being used). 

These and other objectives, advantages, and features of the invention will become 
apparent to those persons skilled in the art upon reading the details of the method as more 
fully described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing features of the present invention may be fully understood from the 
following detailed disclosure of a specific preferred embodiment in conjunction with the 
accompanying drawings in which: 
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Fig. 1 is a schematic representation of the apparatus useful in one specific 
embodiment of the invention: Light from a high energy light source 1 is collected and 
collimated by unit 2, directed through a shutter assembly 3 and passes through a excitation 
filter-changer 4. A light guide 5 directs excitation light into the lensing and epi-illumination 
optics housed in unit 7. Excitation light emerging from 7 illuminates the horizontal detector 
layer located in the multi-component assembly having two solid layers 10 and 11 fixed 
relative to a supporting stage unit 8. Layer 11 is moved vertically downward on guide pins 
(17 Fig. 2b) controlled by arm 12 driven by unit 13. Four sprung contacts 14 attached to 12 
press upon the frame of layer 11 to drive it downwards as arm 12 descends. Specified devices 
(3, 4, 9, 13, 15, 16) are controlled by central processing unit 6 which issues commands and 
collects data and status information from the devices attached to it. Unit 6 includes a central 
processing unit, RAM, multi-channel serial input/output cards with onboard A/D and D/A 
converters, one of which cards controls the camera 16 and captures images from it. 

Figs. 2a-c: Figs. 2a and 2b are side view of the test stage (not to scale); Fig. 2c is a top 
view of the test stage. A supporting stage 8 has a rectangular central aperture the shape and 
size of which is the same as the area 19 of Fig. 2c. The position of stage 8 is adjusted in the 
horizontal and vertical axes by the 3-axis positioner 9. Components of the test stage shown 
include, solution layer 18, (not shown: detector layer 20 and array of test compounds 21 in 
Figs 3 and 4). The array 21 is held away from the liquid layer by pins 17 which pass through 
holes (24 in Fig. 5) in the corners of the frame 11. Arm 12 is moved down by the drive unit 
13, and the four sprung contacts 14 it bears exert pressure on the frame 1 1 moving it down 
the guide pins 17 and into close proximity to the upper surface of 10, from which it is 
separated by a thin liquid layer 18. 

Fig. 3 is a schematic showing the relative positions of the different layers in the test- 
array/detector layers used in one specific embodiment. The layers are depicted in apposition, 
as they would appear after arm 12 has pushed component 1 1 down the support pins 17. An 
array of discrete spots of test compounds 21 on a porous membrane 19 is in contact with a 
liquid layer 18 overlaying the detector layer 20 which is supported by an optically transparent 
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solid substrate 10. The compounds fill the parallel capillary spaces in the track-etched 
membrane 22. 

Fig. 4 is a schematic drawing of a second embodiment of the screening method of the 
invention. The layers are depicted in apposition, as they would appear after arm 12 has 
pushed component 11 down the support pins 17. A detector layer 20 supported on an 
optically clear porous membrane 19, and overlayed by a liquid layer 23, is placed onto an 
optically clear solid substrate 10 bearing an array of test compounds 21. The thin space 18 
between components 19 and 10 is filled with solution from 23 which has passed through the 
porous membrane 19. Bioactivity is detected by measuring changes in fluoresence of the 
detector layer resulting from responses to the diffusion of test compounds through the porous 
membrane to the detector layer. 

Figs. 5a-c are schematics illustrating transfer printing of an array of compounds onto a 
surface of a track-etched membrane. Compounds are stored in 16 separate 96-well microtitre 
plates and defined amounts are transferred simultaneously by a 96-pin printing head to the 
surface 19 (Fig. 5a). The contents of each successive 96-well plate are printed at a slightly 
offset position, generating an array after 4 such printing operations (Fig. 5b), and a full array 
of 1536 compounds after 16 printing operations (Fig. 5c). 

DETAILED DESCRIPTION 

Before the present method and solutions used in the method are described, it is to be 
understood that this invention is not limited to particular methods, components, or solutions 
described, as such methods, components, and solutions may, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular 
embodiments only, and is not intended to be limiting, since the scope of the present invention 
will be limited only by the appended claims. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, the preferred methods and 
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materials are now described. All publications mentioned herein are incorporated herein by 
reference to disclose and describe the methods and/or materials in connection with which the 
publications are cited. 

Generally, the invention is drawn to a method for high throughput screening of test 
compounds, by contacting a solid support comprising an array of multiple test compounds 
with a detector layer, wherein each test compound comes into contact with a localized liquid 
which is in contact with a detector layer, and detecting a response of the detector layer to the 
test compound, wherein a bioactive test compound is identified. 

The high density format screening system (HDFS) of the invention, rests in part on the 
realization that the delivery of test compounds to detector surfaces can be greatly simplified 
by doing away with the need for complicated microfluidics. Test compounds are applied to 
the detector surface in a massively parallel manner, and the method is applicable to a large 
range of different types of test compounds. 

Central to the specific embodiments of the method and apparatus of the invention, 
described below, is the use of living cells as detectors, their responses being signalled via 
changes in the fluorescent or luminescent properties of various specific probes located within. 
However many different types of detector systems could be used in place of cells in such a 
system, for example, appropriate variants of Scintillation Proximity Assay (SPA) systems 
(Amersham Pharmacia Biotech) and enzyme-linked immuno-sorbent assay (ELISA) systems 
(Amersham). 

Test Compound Arrays 

The array of test compounds is formatted to have the same dimensions as the detector 
surface. In one specific embodiment of the invention, array and detector layers have a width 
of 8 cm and length of 12.5 cm, so as to fit within the format of conventional 96-well or 384- 
well microliter plates. Preparation of the test arrays will depend on their origin. 

Test compounds held in formatted arrays . Current methods for the production of 
single compounds by combinatorial methods are under development which involve 
miniaturization and patterned arrays of tethered solid-phase substrates. Thus, test compound; 
generated by combinatorial methods can be used to synthesize an array directly or indirectly 
on a carrier sheet. In one embodiment, vapor phase solubilization is used to produce a test 
compound array on the synthetic substrate, followed by a printing process of the test 
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compound array on to an absorbent membrane. In this embodiment, the test array is the 
printed membrane. An attractive feature of this method is that multiple copies of the same 
test array can be produced at one time to be screened against multiple cell systems for specific 
activities which minimizes stock handling from library archives. 

Currently most compounds to be screened come in 96-well format. However, the 96- 
well format can be altered by repeated off-set printings, to any chosen density of format that 
the transfer substrate and assay can support. The optimum density of compounds in the test 
array will depend very much on the fraction of compounds in an array which generate 
bioactive responses in the detector layer ("hit rate"). The hit rate will depend on how well the 
compound library being tested matches the targets in the assay. If the hit rate is low, e.g., 
1:20,000 - 100,000 compounds tested, a test array with center to center spacing of 200 (am 
(giving 240,000 separate compounds in a 12 cm x 8 cm area) may be preferable, providing 2 
to 10 hits per plate. At a spacing of 1 mm, 9,600 test compounds may be screened 
simultaneously. 

The density of the format may be adjusted as required without requiring any changes 
in the hardware used to perform the re-formatting; rather, adjustment may be made in the 
degree of off-set and the number of print operations used per array. 

Detection 

Fluorescent imaging provides a way to monitor physiological responses of living cells 
in a non-invasive manner. Ion- and voltage-sensitive probes, as well as the new generation of 
recombinant fluorescent probes, for instance, hybrid proteins comprising fusions of green 
fluorescent protein variants (GFPs) to cellular proteins involved in intracellular signaling, can 
be used singly or in combination to report on many aspects of cellular microphysiology. Due 
to the strong fluorescence of GFP, the luminescence of cells expressing the probes may easily 
be detected and analyzed by employing a combination of fluorescence microscopy and image 
analysis. Furthermore, these probes described are easily introduced into cells, as they can be 
expressed in the cells of interest after transfection with a suitable expression vector. 

Recombinant probes for second messengers and enzyme activity, such as kinase 
activity, are not only useful in basic research but also in screening programs aiming at 
identifying novel biologically active substances. As an example, any currently used screening 
program designed to find compounds that affect cAMP concentration and protein kinase 
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activity are based on receptor binding and/or irnmuno detection and/or reporter gene 
expression. The recombinant probes described herein, on the other hand, make it possible to 
develop an entirely new types of screening assays able to monitor immediate and transient 
changes of cAMP concentration and protein kinase activity in intact living cells. 

The HDFS method of the invention monitors the response of cell populations to test 
compounds. Lens systems are currently available which can simultaneously epi-illuminate 
and image the fluorescence from areas in excess of 8.5 x 13 cm, the size of a standard 96-well 
plate. The detection method used herein collects a variety of fluorescent signals from all cells 
in a field, with responses from discrete areas of the field being apparent in the real image of 
the fluorescence from that field as formed on the surface of the photosensitive detector 
(imaging camera). 

Delivery of Test Compounds to Detector Cells 

In a first embodiment of the method of the invention, delivery of large arrays of test 
compounds to cells is achieved with test compounds which are present on or transferred to a 
porous carrier sheet. In specific embodiments, test compounds are printed on the carrier 
sheet, and the sheet is applied (overlayed) to a field of cells of the same area. The test 
compounds reach the detector cells by diffusion through a localized buffer layer immediately 
in contact with an area of the detector cell layer. This embodiment is shown in the schematic 
of Figs. 2 & 3. 

Porous carrier sheet for delivery of test compounds : Test compound arrays are fixed 
onto the porous carrier sheet by a variety of methods known to the art. For example, an array 
of test compounds may be transferred and fixed to the carrier sheet by the method of contact 
printing, whereby an array of inert flat-ended pins (e.g. made of stainless steel) is used to 
transfer defined volumes of individual test compounds (in the range 50 nl to 2 jal) in solution 
form to discrete points on a dry carrier sheet. 

A porous membrane useful in the delivery of test compounds is a membrane 
constructed of a non-absorbent material with pores of regular and defined diameter which 
traverse the membrane directly from the upper to the lower side. The property of orthogonal 
capillarity is useful in these membranes to limit lateral spread of test compounds applied to 
the membranes as discrete spots of liquid, since it is important that the compounds remain as 
discrete spots upon the membrane. A variety of membranes of different thicknesses, 
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materials, and pore densities are commercially available from a number of manufacturers. 
For example, porous membranes useful in the method of the invention include a track-etched 
polycarbonate or polyester membrane (Corning Costar or Whatman/Polyfiltronics). These are 
available in thicknesses from 6 to 23 microns, with pores of 14 to 0.015 microns, at 100,000 
to 1,000,000,000 pores/cm 2 . For delivery of test compounds with maximum ease of handling 
and loading of test compounds, polycarbonate membranes are preferred, particularly of a 
thickness of greater than 10 microns, with pores between 1 and 10 microns diameter at 
densities of between 20,000,000 to 100,000 pores/cm 2 , respectively. One preferred 
membrane is Nucleopore® from Corning Costar. 

Alternative membranes useful for the delivery of compounds include cast cellulose 
acetate (Membra-fil® ), PTFE membranes (e.g. Filinert™), and glass fiber filters, all available 
from Corning Costar. These thicker membranes encourage lateral spread of liquid samples 
applied to their surfaces, but are thicker and could thus be used to deliver larger amounts of 
compounds. 

Track-etched and cast cellulosic membranes may also be given hydrophilic or 
hydrophobic surface treatments. It is useful to have membranes whose surfaces have defined 
wettability properties. 

When the test compound is soluble, the compound will dissolve into the buffer upon 
contact with the buffer medium, and directly contact the detector layer immediately 
underlying the buffer layer. In this embodiment, the test compounds dissolve upon contact 
with the buffer medium, and fall vertically onto the detector layer as a result of having a 
higher density than the surrounding liquor. It is generally preferred that the thin buffer layer 
between the test compound membrane and detector layer not be stirred significantly by 
convection. At the detector layer, the vertical fall of a solution of test compound is expected 
to spread radially by displacement and diffusion. The radial extent of a measured response 
may thus be use as an indicator of the bio-potency of the compounds involved. 

Test compounds of limited solubility, such as those expressed on the surface of a 
carrier system, for instance, a cell membrane, viral or phage particle, must be brought into 
very close proximity, including direct contact, with the detector layers. 

Buffer and Detector layer . The detector layer may be a continuous or non-continuous 
layer of living cells. In a specific embodiment, the detector layer is a continous cell 
monolayer corresponding in size to the test compound array. In more specific embodiments, 
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thin glass substrate, suitably tissue culture treated is preferred for fluorescent probes requiring 
excitation wavelengths below 400 nm. 

Living cells are maintained under physiologically viable conditions, as defined by 
such parameters as oxygen consumption, membrane potential, mitochondrial potential and 
cytoplasmic ion balance. Provision of these conditions requires the use of solutions able to 
supply essential nutrients and buffer pH changes normal to the continued growth of living 
cells. Such solutions may be complete cell culture media (i.e. any of those commercially 
available, for instance from Life Technologies Ltd.) optionally supplemented with antibiotics 
and serum preparations for optimal cell growth conditions. Buffer solutions may also be of 
the type known as "chemically defined ,, (e.g. phosphate buffered saline solutions). Cells will 
also require controlled temperature conditions, in the range 20° to 37°C, and the provision of 
gases essential to continued cell growth and maintenance of buffer capacity (0 2 , and 
optionally 5% C0 2 , depending on the type of buffer being used). 

Detection of bioactivity . Detection of bioactivity may be determined by a number of 
methods known in the art. In a preferred embodiment, detection of bioactivity is determined 
by cellular imaging of fluorescence. For example, imaging may be conducted of a cell layer 
on a clear glass substrate. A glass substrate having a surface pitted with a regular array of 
very shallow (approx 20 (im) depressions may be used for this purpose (Corning). This glass 
substrate is useful because it ensures a regular and defined spacing between the overlying test 
array and the cells beneath. 

In one embodiment, the detector layer is an undemarcated area of living cells growing 
on a flat culture surface. The cells on this surface may or may not be grown to confluence, 
may be transformed and/or engineered cells, or directly derived from animal tissues and 
grown as primary cell culture. In a second embodiment of the method of the invention, the 
array of test compounds is laid out onto a non-porous substrate (such as thin coverglass sheet) 
which is transparent or optically clear. Imaging will be through this surface, and through the 
cell support membrane lying above. The substrate (Fig. 4, 10) should be inert and solvent 
tolerant. For example, borosilicate glass sheets of about 200 microns thickness, which may 
be further surface-treated to give either hydrophobic or hydrophilic properties as desired. 
This embodiment is shown in the schematic of Fig. 4. 

Detector layer: In one embodiment of the invention, the detector layer is a layer of 
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living cells cultured on a thin porous membrane. A porous membrane useful in the culture 
and transfer of cells is a transparent non-absorbent membrane with pores of regular and 
defined diameter which traverse the membrane directly from the upper to the lower side. A 
porous sheet suitable for cell growth is a track-etched polyester membrane about 10 microns 
thick with pores between 0.015 and 5 microns diameter at densities of between 600,000,000 
to 400,000 pores/cm 2 repectively (Nucleopore® from Corning Costar). 

Delivery of test compounds to detector layer . The porous membrane which supports 
the detector layer, complete with the buffer medium which overlays it, is applied onto the 
(dry) test array. Buffer medium wets the lower surface of the porous membrane (Fig. 4, 19) 
and forms a continuous thin film 23 between the array of test compounds 21 and the porous 
membrane 19. Test compounds diffuse up through the pores to the detector layer above. In 
one embodiment of the invention the detector layer is a monolayer of living cells overlayed 
with physiological buffer solution. The invention includes the possibility that under some 
conditions it is desirable to have cells grow processes through the membrane to make direct 
contact with substances on the test array below, with the use of a membrane having an 
appropriate pore diameter. 

Further embodiments and general considerations . Where a test array is generated as a 
complex mixture of components, such as from the "teabag" method of combinatorial 
synthesis, or from cDNA library expression systems, a separation step may first necessary. 
Separation of test components may be conducted in any number of ways known to the art. 
In one embodiment, components may be separated by the use of one- or two-dimensional 
separation techniques in non-denaturing gels. The resulting gels may be used directly as test 
arrays. 

Specific separation methods will be tailored to the components involved. Any 
bioactive compounds from such an array would be identified from identical copies of the 
original test gel. 

Detection of Bioactivity . 

Lens and illumination system . Specialized light sources and optics are needed to 
illuminate and image the fluorescence coming from an area the size of a microtiter plate (96- 
well plate). Such a system is available from: Imaging Research Inc., St Catherines, Ontario, 
Canada, and consists of a high-power light source directed through a specialized lens which 
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acts both as a wide-field epi-illuminator and imaging device. 

An illumination system useful in the HDFS device is able to deliver excitation light 
over an area of at least 8.5 by 13 cm at an intensity sufficient to excite measurable 
fluorescence from that test field (which in most cases will be living cells loaded with 
fluorescent reporters). The illumination may come from a scanned beam, or be wide-field for 
simultaneous illumination of the entire area. The imaging system will collect fluorescent 
light from the entire test area and bring it to focus onto a sensitive imaging photodetector, 
such as a cooled CCD camera chip. 

Screening . The practice of screening large libraries of samples of unknown 
composition for the few which may contain a compound of specific biological activity is one 
of the more common methods of new drug discovery. The samples of unknown composition 
are in most cases biological material, such as plant extracts or microbial fermentation broths. 
Screening these for biological activity is normally accomplished by performing binding 
assays or, more recently, functional assays. A binding assay is an attempt to find compounds 
of interest by identifying those which adhere with some desired affinity to cells or cell 
products. This can be done using fluorescent, luminescent, or radioactive detection methods. 
These assays are based not on a biological response, but passive processes of adherence and 
displacement. They cannot be construed as functional assays or as real-time assays. Another 
way to determine biological activity is to measure up-regulation or down-regulation of 
expression of a known gene. This is done by inserting DNA which codes for something 
which can be readily measured into a cell's genome such that the expression of interest is 
coupled to expression of the inserted DNA. While this is a true functional assay, it also is not 
a real time assay. In addition, it is only capable of finding compounds which affect gene 
expression. In many cases this is not the response of interest. 

The CytoSensor described in U.S. Patent No. 4,915,812 and U.S. Patent No. 
5,395,503 is a commercial instrument which has been billed as a screening instrument. It is 
based on the detection of increased cellular proton flux by means of a semiconducting 
electrode. The instrument is applicable to high through-put screening, but can only detect 
cellular events that result in changes in extracellular pH. Again, many responses of interest 
are not associated with changes in extracellular pH. 

The growth over the last few decades in the knowledge of cellular signaling has 
presented extremely rich opportunities for new ways of screening for biologically active 
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compounds. Armed with knowledge of the biological process which one wants to affect with 
a new product, it is possible to monitor the actual process as a way of looking for compounds 
which affect it. The development of fluorescent probe molecules which upon interaction with 
intracellular signaling molecules (e.g. ions, enzymes, cyclic nucleotides) change their spectral 
properties has enabled the real-time monitoring of dynamic biological responses within living 
cells. Most of these probes can be introduced non-invasively into cells and will, depending 
on the detection system, allow characterization of cellular events in high temporal resolution 
(microseconds to seconds) and high spatial resolution (nanometers to micrometers). This 
probe technology, in combination with the technology of cellular imaging which is described 
below, has had a major impact on cell biology in that it has enabled monitoring of complex, 
cross-reacting intracellular events that could not be unravelled by conventional invasive 
biochemical techniques. 

Imaging of cellular functions using luminescent probes. Visualization of intracellular 
function using luminescent (fluorescent or bioluminescent) probes has become one of the 
mainstay techniques in modern cell biology. Using traditional optical microscopes with 
quantitative detectors in place of the human eye, both the concentration and distribution in the 
cell of a variety of intracellular molecules of interest can be measured. While luminescent 
probes can be measured in large populations of cells using other techniques, imaging is the 
only way to learn what is going on in single cells or small populations of cells. The imaging 
capabilities of the HDFS apparatus will be limited to rather low spatial resolution - 
fluorescent changes will be imaged from the entire field of detector layer up to 8cm by 12.5 
cm. When the detector layer comprises living cells, individual cells need not be resolved in 
the image, only the fluorescent signals from regions in which cells are present. 

The imaging times will vary depending on the responses and parameters being 
monitored. Signaling responses, for instance changes in the level of free calcium in cellular 
cytoplasm, may first be seen within seconds or minutes following delivery of test compounds 
to the detector layer. Such changes can be monitored by changes in the fluorescent properties 
of specific chemical probes, for instance Fluo-3 or Fura 2 may be used to report on 
cytoplasmic calcium. The way in which these changes develop within cells (time-response 
profile) is an important diagnostic feature of the signaling processes giving rise to them. 
Rapid responses are therefore recorded by sequences of images, where the time between 
images in a sequence is between 0.1 and 30 seconds (depending on the response being 
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screened for). Transcription mediated events may require minutes to hours to develop. 
Monitoring may be continuos or intermittent. For slow responses, two images can be 
sufficient to gauge the level of response, the first taken before application of test compounds, 
the second after a period during which the response is estimated to have reached its maximum 
extent. 

Controls relevant to the parameters being measured can be incorporated into the test 
arrays, both as a check for cell responsiveness and as co-ordinate markers within the arrays. 
The detector layer is continuous and undemarcated, but because of the close apposition of the 
test array to the detector layer, the center point of a response in the detector layer corresponds 
to a conjugate coordinate in the test array. It is helpful to have compounds in the test array 
which will generate known responses at known coordinates in the detector layer. Responses 
at the conjugate coordinates in the detector layer act as controls for the system's response, 
against which responses of the detector layer to unknown compounds may be compared; the 
points of response to control substances also act as reference points in the detector layer from 
which the coordinates of other responses can be mapped. For example, when bioactivity is 
determined as the ability to alter the level of free calcium in cellular cytoplasm, common 
calcium-mobilizing agonists such as carbamylcholine or adenosine trisphosphate are included 
in the test array at known coordinates. 

As another example, when a change in the cellular ratio of inherently fluorescent 
NAD(P)H/FAD is the biological parameter being assayed, metabolic inhibitors such as KCN 
or rotenone may be used as a control and marker compounds. 

In many instances, diffusion within a thin fluid layer will be involved in many 
applications of the screening method of the invention, and a concentration gradient will be 
established from each test point. Those few compounds in a test array which have bioactivity 
should be detected as spreading rings of response from the focus point of diffusion, within a 
field of the detector showing no response. The extent of the response areas (measured over 
time ), compared with those from control substances, will provide an indication of potency 
and solubility of the compound responsible, and also obviate the need to make serial dilutions 
of test compounds. Toxic or inhibitory substances may also be determined by causing blank 
sectors in response rings from known agonists. Inhibitory compounds may be determined by 
their actions on a (pre-)stimulated detector field. Detection of bioactive compounds may 
incorporate simple image processing to determine the focus, extent and potency/efficacy from 
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the areas of activity measured in a detector field. 
Apparatus 

In specific embodiments, the apparatus and method of the invention are as shown in 
Figs. 1-4. Fig. 1 shows a high energy light source 1, either a mercury or xenon arc lamp, light 
from which is collected and collimated by unit 2, directed through a shutter assembly 3 and 
passes through a excitation filter-changer 4. A high-quality light guide 5, either of fused 
quartz or a UV-compatible liquid light guide, directs excitation light into the lensing and epi- 
illumination optics housed in unit 7. Excitation light emerging from 7 evenly illuminates the 
horizontal detector layer located in the multi-component assembly labeled 10 and 11. 

Further details of this assembly are shown in Figs. 2a-c, 3, and 4. The assembly 
comprises two solid layers of which 10 is fixed relative to the stage unit 8 which supports it, 
while layer 11 is moved vertically downward on guide pins (17 in Figs. 2a,b,c) to bring test 
compounds into contact with the detector layer. Vertical movement of 11 is controlled by 
arm 12 driven by unit 13. Four sprung contacts 14 attached to 12 press upon the frame of 
layer 11 to drive it downwards as arm 12 descends. A separate drive unit 9 controls position 
of the stage 8 in the horizontal plane, and also is used to adjust focus by movement along the 
vertical axis. 

Fluorescent light emitted by the detector layer is collected by lensing unit 7, passes 
through an emission filter-changer 15 and is brought to focus on the photosensitive surface of 
an imaging detector housed in unit 16. 

Specified devices (3, 4, 9, 13, 15, 16) are controlled by a central processing unit 6 
which issues commands to, and collects data and status information from the devices attached 
to it. Collected data (images) can also be analyzed by unit 6, or passed to a subsidiary 
analysis station (not shown). Unit 6 comprises: central processing unit (Intel Pentium chip, or 
better), RAM, multi-channel serial input/output cards with onboard A/D and D/A converters, 
one of which cards controls the camera 16 and captures images from it, also a video controller 
card, VDU, and hard disk memory units. 

Figs. 2a,b,c are schematic diagrams of the test stage, which includes a supporting 
stage 8 with large rectangular central aperture, the shape and size of which is the same as the 
area labeled 19. The position of stage 8 is adjusted in the horizontal and vertical axes by the 
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3-axis positioner 9. These diagrams are drawn for the specific embodiment in which the 
detector layer is a layer of living cells growing on the upper surface of the solid transparent 
component 10, which also serves to contain the liquid layer 18 which overlays the cells in the 
detector layer and provides them with necessary nutrients and conditions to keep them alive. 
The printed array of test compounds 21 is borne on a sheet of track-etched membrane 19 held 
by a rectangular rigid frame 11. At the beginning of the screening assay, the array 21 is not in 
contact with the fluid layer 18. The array 21 is held away from the liquid layer by pins 17 
which pass through holes 24 in the corners of the frame 11 and which, by friction or "click- 
stops", prevent it from falling (Fig. 2a). At the appropriate moment, arm 12 is moved down 
by the drive unit 13 and the four sprung contacts it bears 14 exert pressure on the frame 11 
moving it down the guide pins 14 and into the liquid 18 below to a position where it is in very 
close proximity to the underlying layer of detector cells 20 grown on top of the solid substrate 
10 (Fig. 2b). Throughout this procedure, the entire area of the detector layer corresponding to 
the size and shape of area 19 is illuminated and imaged from below by the additional 
apparatus shown in Fig. 1. 

The apparatus can also be used in a second embodiment of the screening method of 
the invention, where the test array is laid out on the upper surface of component 10, and 
components 11 and 19 are a frame and thin transparent track-etched membrane, respectively. 
In this specific embodiment, the frame 11 is sufficiently deep to contain culture liquid as 
required to sustain the detector layer of living cells growing on the upper surface of the 
membrane 19. 

Figs. 3 and 4 are schematics to show the relative positions of the different layers in the 
test-array/detector layers used in the specific embodiments of the invention. Fig. 3 shows the 
arrangement in which an array of discrete spots of test compounds 21 on a porous membrane 
19 is in contact with a liquid layer 18 overlaying the detector layer 20 which is supported by 
an optically transparent solid substrate 10. The compounds fill the parallel capillary spaces 
22 in the track-etched membrane 19. Bioactivity is detected by measuring changes in 
fluorescence in the detector layer 20 resulting from responses to the diffusion of test 
compounds through the porous membrane to the detector layer. 

Fig. 4 is a schematic drawing of a second embodiment of the screening method in 
which a detector layer 20 supported on an optically clear porous membrane 19, and overlayed 
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by a liquid layer 23, is placed onto an optically clear solid substrate 10 bearing an array of test 
compounds 21. The thin space 18 between components 19 and 10 is filled with solution from 
23 which has passed through the porous membrane 19. Bioactivity is again detected by 
measuring changes in fluorescence of the detector layer resulting from responses to the 
diffusion of test compounds through the porous membrane to the detector layer. 

Fig. 5 is a schematic illustrating the way in which an array of 1536 compounds can be 
created on a membrane surface, such as would be useful in the first embodiment described 
above, by simple transfer printing. Compounds are stored in 16 separate 96-well microtiter 
plates and defined amounts are transferred simultaneously by a 96-pin printing head to the 
surface 19. The contents of each successive 96-well plate are printed at a slightly offset 
position, generating an array as shown in Fig. 5b after 4 such printing operations, and a full 
array of 1536 compounds (Fig. 5c) after 16 printing operations. The holes 24 in frame 11 are 
used to position and guide the completed array on the pins 17 indicated in Figs. 2b and 2c. 
The process illustrated in Fig. 5 can also be used to transfer an array of test compounds to a 
solid surface such as would be useful for component 10 in the second embodiment of the 
method described above. 

EXAMPLE 

Example 1. Screening of 1536 Test Compounds for Bioactivity. 

The following description of the use of one embodiment of the apparatus of the 
invention in the screening method disclosed. An array of test compounds are supplied in 96- 
well microtiter plates, as is common practice for compounds produced by methods commonly 
known as combinatorial chemistry, or for compounds extracted from natural sources. In this 
example, the compounds are provided in soluble form, and the concentrations and solvents 
used have previously been tested for compatibility with the apparatus. In this example, 1536 
compounds are tested simultaneously against a known cellular target, specifically a G-protein 
coupled receptor (GPCR) of the Gq type expressed in a transformed cell line. Gq GPCRs 
give clearly identifiable changes in intracellular calcium when activated. 

First, physiologically viable living cells are cultured to a near confluent monolayer in 
a transparent culture dish (10, Fig. 2a-c) in appropriate culture medium and conditions. 
Immediately prior to being used in the experiment, the cells are loaded with the fluorescent 
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indicator of free cytoplasmic calcium concentration, Fluo-3 (from Molecular Probes, 
Oregon). This is accomplished by incubating the cells with a 2 to 5 jiM solution of Fluo-3 
acetoxymethyl ester (AM) for a period of 10 to 15 minutes, followed by a series of solution 
exchanges to wash away excess Fluo-3 AM. 

The method of transfer of compounds to the track-etched membrane Fig. 2a-c 19 is 
illustrated in Fig. 5. In this example, 1536 compounds are printed as an array 21 on a single 
track-etched membrane 19, from sixteen individual 96-well microtiter plates in the following 
manner: A 96-pin printing head is used to transfer defined volumes of compounds (in the 
range 0.05 to 0.5 |il of each compound), one compound per pin, from each 96-well plate in 
turn (with wash steps between source plates to avoid cross-contamination). Each 96-point 
print to the membrane occurs in an offset grid, such that 16 print operations are made 
sequentially on the same membrane and the printed spots of compounds remain discrete and 
separated from each other (three of these spots are indicated in Fig. 5a, 21). Fig. 5a shows the 
result of a single 96-point print operation, Fig. 5b after four such operations, and Fig. 5c the 
finished array after 16 print operations. In this way, just sixteen print operations (and sixteen 
intermediate wash steps for a single print head) are sufficient to transfer 1536 compounds to a 
single test array. The procedure can be readily automated, and multiple copies of each printed 
sheet made for multiple tests. 

Completed arrays are fixed to the pins 17 (Figs. 2b-c) projecting from the culture dish 
10 such that they are supported some small distance above the thin fluid layer 18 covering the 
living cells which form the detector layer. Once the test array is fixed in place over the Fluo- 
3-loaded cells, the entire assembly is placed onto the test stage as shown in Fig. 2a. 

The following events are synchronized by sequential instructions from the computer 
processing unit 6. First, the test stage is centered over the lensing unit 7 (Fig. 1) and the 
detector layer it supports is brought into focus by the motor unit 9. Fluo-3 is excited by light 
of 490 nm, and its fluorescent emissions are collected in the range 505-540 nm. The intensity 
of emission is increased when the dye binds free calcium. Thus the computer brings a 490 
nm band-pass excitation filter into line of the light path coming from units 1 and 2 using the 
filter changer unit 4. At the same time, a band-pass emission filter for the range 505-540 nm 
is positioned in the imaging path by unit 15. The shutter 3 is opened for a pre-determined 
exposure period (typically 50 to 500 milliseconds), and during this time the whole area of the 
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detector layer is illuminated with 490 nm light. Fluorescent emission from the Fluo-3 in the 
cells is collected by the lens 7 and focused into the camera. The camera captures the image 
and sends it to the processing unit 6 where it is stored and displayed. At regular intervals 
thereafter, images are captured in sequence by repeatedly opening the shutter 3. Intervals 
between successive images are typically in the range 0.5 to 30 seconds, depending on the 
speed of the response expected. Intervals of 0.5 to 2 seconds are usual and sufficient to 
sample the dynamics of most changes in cellular calcium. At a predetermined time during 
this continuing sequence of images, the test array is pushed down the guide pins 17 by the 
actuating arm 12 and its sprung contacts 14, driven by unit 13. In close apposition to the cells 
in the detector layer, the test array begins to release the compounds it carries. The 
compounds dissolve into the the liquid layer, and these fall vertically downwards onto the 
cells below. Because there is only a thin liquid layer between the membrane of the test array 
and the cells below, there is insignificant intermixing of adjacent test compounds. If a test 
compound activates cells below it bearing Gq GPCRs, these cells will respond with an 
immediate increase in free cytoplasmic calcium, and the fluorescence signal from the Fluo-3 
dye they contain will increase. The sequence of images collected during the period of the 
response (which is typically of 1 to 10 minutes duration) will reveal which cells have so 
responded, and their position in the area of the detector layer will be correlated with the 
identity of the compound in the array above. An analysis of the entire area of each image in 
the sequence, performed on-line by the processing unit 6, yields the following information: 
the identity of the compound eliciting the response, the profile of the response with time, the 
intensity of the response, and also the potency of the compound with reference to a chosen 
standard. The latter information is contained in the radius of the area of cells responding 
within a particular time, and can be compared directly to a known standard which is included 
in the array at known points. The use of standard compounds at known points in the array 
also provides a control for the experiment, and helps to identify coordinates in the detector 
layer from which other responses can be mapped. 

At the end of the screening assay, the sequence of images is stopped, the actuating 
ami 12 raised, and the test assembly removed. The next assembly is then moved in and the 
sequence begun afresh. Assembling the test units and exchanging them on the test stage can 
be automated by appropriate robotic control (not shown in the diagrams). 
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One of the advantages of the method of the invention is that the method does not 
require that either the components of the detector layer (e.g. living cells), or the different test 
compounds, be isolated from one another within discrete chambers or compartments, as is 
common to all high throughtput screening procedures currently in use or development. The 
method also removes the need to dispense microvolumes of test compounds during the period 
of the assay itself. Delivery of test compounds to detector layers is either by direct contact or 
by simple diffusion across thin liquid films. Delivery and detection becomes a (massively) 
parallel process. 
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CLAIMS 

What is claimed is: 

1. A method for screening test compounds for bioactivity, comprising: 

(a) contacting an array of test compounds with a detector layer; and 

(b) detecting a detector layer response, wherein a response is indicative of bioactivity. 

2. The method of claim 1, wherein the detector layer is comprised of physiologically 
viable cells. 

3. The method of claim 2, wherein the detector layer is supported by an optically clear 
substrate. 

4. The method of claim 3, wherein the reactive sensing surface is held stationary in the 
field of view of the optical detector and the sample surface is moved into contact with it 
during the course of the measurement. 

4. The method of claim 1, wherein the detection of step (b) is a change in a fluorescence or 
luminescence property of the cell. 

5. The method of claim 4, wherein detection is determined with an illumination system 
capable of exciting the fluorescence of the reactive surface with any of a number of 
previously selected wavelengths with defined order and of defined time duration. 

6. The method of claim 2, wherein the physiologically viable cells form a monolayer. 

7. The method of claim 1, wherein the test compounds are generated on a solid support 
by combinatorial chemistry. 

8. The method of claim 1 , wherein the test compound array is generated by one- or two- 
dimensional gel electrophoresis. 
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9. A method for high throughput screening of test compounds for bioactivity, 
comprising: 

(a) contacting a solid support comprising an array of multiple test compounds with a 
cell layer, wherein each test compound comes into contact with a localized liquid which is in 
contact with a detector layer; and 

(b) detecting a response of the detector layer to the test compound, wherein a 
response is indicative of a bioactive compound. 

10. A method for simultaneously exposing an array of test compounds with a reactive 
sensing surface, comprising the steps of: 

(a) contacting an array of test compounds on a solid matrix with a porous membrane 
which is in contact with a liquid layer overlaying a reactive sensing surface layer; and 

(b) allowing the test compounds to diffuse through the porous membrane to the liquid 
layer overlaying the reactive sensing surface. 

11. An apparatus for screening an array of test compounds for bioactivity, comprising: 

(a) a solid support comprising an array of test compounds; 

(b) a porous membrane; and 

(c) a detector layer layer, wherein a liquid layer is between the porous membrane and 
detector layer layer, and wherein the test compounds contact the detector layer layer by 
diffusion through the porous membrane. 
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METHOD AND APPARATUS FOR HIGH DENSITY 
FORMAT SCREENING FOR BIOACTIVE MOLECULES 



Abstract 

A method and apparatus for screening an array of test compounds for 
bioactivity by contacting an array of test compounds with a detector layer capable of detecting 
bioactivity, and detecting a detector layer response. The detector layer is comprised of 
physiologically viable cells. The method and apparatus allow a large number of test 
compounds to be simultaneously assayed in parallel. 
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Top view of test stage; not to scale 
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3-D sectional representations of portions of 
the test-array/detector layers: not to scale 
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ABSTRACT 

A novel method to monitor changes in intracellular cAMP concentration ([cAMP],) 
within intact living cells has been developed based on a fusion of the catalytic subunit of 
cAMP-dependent protein kinase to green fluorescent protein (GFP). In stably 
transfected unstimulated fibroblasts, fusion protein fluorescence was highly 
concentrated in aggregates throughout the cytoplasm and absent in the nucleus. 
Stimulation with the adenylate cyclase activator forskolin caused the release of tagged 
catalytic subunits from the cytoplasmic aggregates within minutes, resulting in an 
increasingly homogeneous distribution of GFP fluorescence throughout the cytoplasm. 
The observed redistribution was completely reversible: removal of forskolin led to the 
return of fluorescence to the cytoplasmic aggregates. Spot-photobleach measurements 
showed that the rate of exchange of GFP-labelled catalytic subunits at these aggregates 
increased in proportion to [cAMP] ( . The localisation of the fusion protein was also 
sensitive to receptor stimulation. In fibroblasts stably expressing the G s -protein coupled 
glucagon receptor, generation of an increased [cAMP] i through glucagon stimulation 
resulted in a redistribution of tagged catalytic subunit similar to that observed after 
forskolin addition. Conversely, in fibroblasts overexpressing the G,-protein coupled a2a 
adrenoreceptor, addition of norepinephrine after forskolin stimulation led to a reversal 
of the fusion protein redistribution. 

INTRODUCTION 

The cAMP-dependent protein kinase (cAK) 1 is a ubiquitous serine/threonine protein 
kinase. cAK is recognised as the only mediator of intracellular cAMP signals in 
eukaryotes : , with the exception of certain ion channels 3 . The cAK holoenzyme is an 
R,C, tetramer consisting of a regulatory (R) dimer and two catalytic (C) subunits 2 . 
Presently, four isoforms of the regulatory subunit (RIa, Rip, Rlla and RIIp) and three 
isoforms of the catalytic subunit (Ca, C(3 and Cy) have been described 2 . Splice variants 
of Ca and Cp 4 and possible R heterodimers, as reported for RIa and Rip 5 , add to the 
complexity of the cAK holoenzyme. Although the Cy isoform is unique with respect to 
substrate specificity, inhibition and tissue distribution 6 , few reports suggest different 
roles for Ca and CP isoforms of the catalytic subunit 7 . In contrast, the RI and RII 
subunits are reported to be distinct. The cAKI (RI,C 2 ) holoenzyme is thought to be 
mainly soluble and cytoplasmic 11 although RI is reported to be associated with 
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sarcoplasmic membranes 8 and also with a detergent-resistant structure in mammalian 
sperm 9 . cAKII (R1I 2 C : ) on the other hand is thought to be particulate and RII has been 
reported to bind to a number of intracellular components, most notably Golgi 
membranes 1011 and centrosomes 1011 but also mitochondrial nuclei 1314 and cytoskeletal 
components"'". RII subunits interact with a family of proteins called A-kinase 
anchoring proteins (AKAP) 1 " and this may also be true of RI subunits 16 . The AKAP-RII 
subunit interaction is presumed to be responsible for localising the cAKII tetramer at 
these intracellular sites. The NH 2 -terminus of the C subunit is myristoylated 17 , a post- 
translational modification usually associated with membrane insertion. However, the C 
subunit does not appear to be membrane attached and while myristoylation may increase 
the thermostability of the protein, the possible role of myristoylation in its targeting or 
substrate specificity is still not clear 18 . 

The C subunit in the assembled tetramer is believed, although not unanimously 19 , to 
be catalytically inactive. Activation of cAK is physiologically mediated through G s - 
protein coupled plasma membrane receptors. G s -protein activation leads to activation of 
adenylate cyclases, which generate cAMP. Binding of two molecules of cAMP to each 
R subunit causes the release and activation of the C subunits. Dissociated C subunits 
phosphorylate cytoplasmic substrates 20 21 and have been shown to relocalise to the 
nucleus 22 . The nuclear redistribution mechanism of C subunits may be by simple 
diffusion through nuclear pores 21 . To date a large number of cytoplasmic and a few 
nuclear cAK substrates have been reported. An incomplete list of 25 in vitro substrates 23 
includes several enzymes involved in basic metabolism such as phosphorylase kinase, 
glycogen synthase and fructose bisphosphatase. Nuclear C subunit regulates 
transcription of genes under control of the cAMP response element (CRE) by 
phosphorylating the continuously bound CRE binding protein, (CREB ) 24 ' 2 ~\ 

Several factors decrease the level of cAK activity. Stimulation of plasma membrane 
bound G ( -protein coupled receptors inhibits adenylate cyclases and cAMP is 
continuously being broken down by a variety of phosphodiesterases. Despite the 
importance of the cAMP/cAK signalling pathway, there is no easy method to monitor 
intracellular cAMP concentrations ([cAMP],) in intact living cells. The current method 
of choice involves fluorescence resonance energy transfer (FRET) between 
microinjected fluorescently labelled R and C subunits 26 . In the work described herein, 
the Ca subunit was tagged with a highly fluorescent variant of green fluorescent protein 
(GFP) containing F64L and S65T amino acid substitutions (GFP L1 ) (International 
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Publication No. W097/1 1094). This approach provides a transfectable probe for 
monitoring the intracellular trafficking of C subunits in response to changes in [cAMP] i 
and represents the first easy method to evaluate changes in [cAMP], in intact living cells 
in response to extracellular signals. 



Results 

GFP^ T tagged C had the expected molecular weight 

Lysates of glucagon receptor-transfected baby hamster kidney cells (BHK/GR) stably 
expressing the C-GFP LT fusion protein were characterised by Western blot analysis 
using polyclonal antibodies directed against the NH r terminus of Ca (Fig. 1). In a 
separate experiment, lysates of BHK cells, transiently expressing either of the two 
fusion proteins, were characterised by Western blot analysis using polyclonal antibodies 
that recognise GFP (data not shown). Taken together, these experiments show that C- 
GFP LT fusion protein is recognised as a unique protein of the expected size by the anti- 
Ca antibody in stably transfected cells and that both fusion proteins have the same 
molecular weight. 

The fusion protein localised to cytoplasmic aggregates. 

The localisation of the two fusion proteins, when transiently expressed in Chinese 
hamster ovary (CHO) cells, was very different. While GFP LT -C was evenly distributed 
throughout the cytoplasm (Fig. 2A), C-GFP LT was found in highly fluorescent 
aggregates in the cytoplasm (Fig. 2B). These distinct patterns for the two fusions was 
also seen in transiently transfected human embryonic kidney (HEK293) and BHK/GR 
cells (data not shown). For unknown reasons it was not possible to make stable 
transfectants expressing the GFP LT -C fusion, whereas this procedure was straightforward 
with the C-GFP LT fusion. The distribution of GFP LT -C in transiently transfected CHO 
cells did not change when [cAMP], was raised by the addition of 50 forskolin (n=6, 
data not shown). The following results are therefore based only on work with the C- 
GFP LT fusion. 
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Increased [cAMP]j caused the release of fusion protein from cytoplasmic 
aggregates. 

Within 2-3 minutes of treatment of CHO/C-GFP u cells with forskolin, C-GFP LT 
fluorescence dispersed from the bright aggregates and filled the cytoplasm (Fig. 3A, 1 
jaM forskolin), remaining in this distribution for as long as forskolin was present (cells 
were followed up to two hours). The probe did not enter the nuclear compartment to any 
clearly observable extent. Higher doses of forskolin increased the rate and extent of 
probe redistribution. The responses depicted in Figure 3B-G have all been quantified 
from image data, as described in the experimental protocol. Table 1 gives a comparison 
of the average temporal profiles of fusion protein redistribution in response to the three 
forskolin concentrations shown in Figure 3B. Addition of 1 mM dibutyryl cAMP 
(dbcAMP) (n=6), a membrane permeable cAMP analogue, which is not degraded by 
phosphodiesterases, caused a similar but slower response (Fig. 3C). Addition of 100 |iM 
3-isobutyl-l-methylxanthine (IBMX) (n=4), a cell permeable phosphodiesterase 
inhibitor, caused a similar, slow response (Fig. 3D), even in the absence of adenylate 
cyclase stimulation. Addition of buffer (n=2) had no effect (data not shown). As a 
control for the behaviour of the fusion protein, GFP LT alone was expressed in CHO cells 
and these also given 50 jaM forskolin (n=5); the uniform diffuse distribution 
characteristic of GFP in these cells was unaffected by such treatment (data not shown). 

To test the reversibility of the fusion protein redistribution, CHO/C-GFP LT cells were 
treated with 10 \jM forskolin (n=2) and washed repeatedly (5-8 times) with 37°C buffer. 
Although the plant terpenoid forskolin is lipophilic, it is possible to remove its effect by 
washing with aqueous buffer". In these experiments, fusion protein began to return to its 
prestimulatory localisation within 2-3 min (Fig. 3E). In fact the fusion protein returned 
to a pattern of fluorescent cytoplasmic aggregates virtually indistinguishable from that 
observed before forskolin stimulation. To test whether the return of fusion protein to the 
cytoplasmic aggregates reflected a decreased [cAMP],, cells were treated with a 
combination of 10 forskolin and 100 )iM IBMX (n=2); when washed repeatedly (5- 
8 times) with 37°C buffer containing 100 |uM IBMX the fusion protein did not return to 
its prestimulatory localisation after removal of forskolin (Fig. 3E). 

To test the probe's response to receptor activation of adenylate cyclase, stably 
transfected BHK/GR,C-GFP LT cells were exposed to glucagon stimulation. In these 
cells, addition of 100 nM glucagon (n=2) caused the release of C-GFP LT from the 
cytoplasmic aggregates and a resulting permanent redistribution of the fusion protein to 
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a more even cytoplasmic distribution within 2-3 min (Fig. 3F). Similar but less 
pronounced effects were seen at lower glucagon concentrations (n=2, data not shown). 
Addition of buffer (n=2) had no effect over time (data not shown). CHO/C-GFP LT cells, 
transiently transfected with the a2a adrenoreceptor (ARa2a), were treated with 10 \jM 
forskolin then, in the continued presence of forskolin, exposed to 10 jaM norepinephrine 
to stimulate the exogenous adrenoreceptors. This treatment led to reaggregation of C- 
GFP LT within the fluorescent structures, consistent with a receptor-induced decrease in 
[cAMP], (Fig. 3G). 

Rate of recovery from photobleach of C~GFP^ T aggregates is dependent 
on forskolin concentration. 

Photobleach measurements were made to confirm that changes seen in the distribution 
of C-GFP LT fluorescence were a result of changes in the rate of turnover of C-GFP LT 
upon the aggregates. The fluorescence of an entire C-GFP LT aggregate within a cell 
could be effectively bleached within 2 to 5 seconds by a stationary laser beam at full 
intensity. After bleaching, aggregates recovered their fluorescence, indicating a dynamic 
exchange of C-GFP LT at these loci (Fig. 4A). The rate of recovery from spot photobleach 
was highly reproducible at each particular concentration of forskolin even in different 
cells (Fig. 4B). Both the extent and rate of recovery increased with the forskolin 
treatment given. Most recovery curves required at least two exponentials to fit them 
adequately. Given the limits of the experimental procedure, the curves are used here 
only to estimate half-times of recovery. To an approximation, half times for recovery 
can be estimated directly from the slope of reciprocal plots of the fluorescence 
displacement for the first few time points 27 . Values for half times estimated within the 
first 3.0 seconds of recovery (Fig. 4C) are plotted as a dose response curve in Figure 5, 
giving an estimated l^-maximal concentration for forskolin of about 3 \iM 



Fusion protein redistribution correlated with [cAMP]} 

As described above, the time it took for a response to come to completion was inversely 
related to the forskolin dose (Table 1). In addition the extent of a response was also dose 
dependent. In an automated imaging system we stimulated CHO/C-GFP LT cells with 5 
increasing doses of forskolin (n=8). Images were analysed with the same algorithm used 
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to construct Figure 3B-G. From the results shown in Figure 5, a half maximal 
stimulation was observed at 1.7 \aM forskolin by this method. In parallel, CHO/C-GFP 1 ' 1 
cells were stimulated with 8 increasing concentrations of forskolin (n=N) and the 
relative amount of cAMP produced was measured in a scintillation proximity assay 
(SPA). The l /2-maximal concentration for forskolin in the SPA assay was determined to 
be 9.3 nM (Fig. 5). 

Co-localisation of C-GFP LT with labelled ceramide distributions 

Figure 6A is an overlay of green and red fluorescence emissions from CHO/C-GFP LT 
cells stained with BOD1PY® FL C 5 -ceramide (ceramide-FL). The green channel 
contains the ceramide-FL and GFP LT fluorescence; the red channel shows only the 
ceramide-FL excimer emission. The ceramide-FL probe preferentially accumulates in 
Golgi membranes 28 . This is most obvious in images formed from the red excimer 
emissions of the FL-ceramide. The GFP LT -bright structures do not stain with the 
ceramide probe indicating that they are clearly distinct from Golgi membranes. 

Structure of the GFF^ r -bright aggregates 

Figure 6B shows an iso-surface rendering of 25 deconvolved and reconstructed through- 
focus wide-field images of a single large C-GFP LT aggregate. Each aggregate appears to 
have the structure of a convoluted tubule or glomerulus, and this is more obvious in the 
stereo pair (Fig. 6C) derived from the same data set from which the iso-surface 
rendering was made. It is not completely clear whether each structure is formed from a 
single fully connected tubule or a small number of discrete tubules in close apposition. 
The structure is however clearly compact and more complex and structured than a 
simple amorphous aggregation of C-GFP LT molecules. Figure 6B-C is typical of the 
larger aggregates which are of the order of 2 to 4 jam across. The more numerous 
smaller aggregates (less than 1 jam across) appear to share the same underlying 
structural component(s) as their larger counterparts. 
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Discussion 

The aim of the present study was to develop a transfectable probe for monitoring 
changes in [cAMP],. Since cAK is by far the major intracellular effector for cAMP 2 , a 
measure of its activation should closely reflect physiologically relevant changes in 
[cAMP], 

NH 2 - and COOH-terminal fusions of C subunit were made to a highly fluorescent 
variant of GFP. Only the C-GFP LT fusion responded to changes in [cAMP],. The three- 
dimensional structure of the C subunit 29,30 reveals that both the NH 2 - and COOH-termini, 
while far apart, are both located opposite the catalytic cleft and close to the surface of 
the protein. Comparison with the closely related cGMP-dependent protein kinase, 
whose R and C subdomains are contained within the same polypeptide chain in R-C 
order 31 , suggests that the R subunit of cAK may be expected to interact with the NH 2 - 
terminal region of the C subunit. Furthermore, the surface of the C subunit in the NH 2 - 
terminal region is hydrophobic 29 , supportive of a protein-protein interaction in this area. 
An NHU-terminal GFP LT tag would also prevent post-translational myristoylation (of the 
NH,-terminus) of the C subunit as reported specifically for mouse Ca 18 , while the C- 
GFP LT fusion may well be myristoylated. These factors may explain the very different 
behaviours of the NH : - and COOH-terminal fusions of C subunit to GFP L \ 

There are reasons to believe, that the C-GFP LT fusion protein behaves like the 
endogenous kinase both with regard to localisation and activation kinetics. Li et al 
(1996)" have, for instance, reported that RII subunits occur as "intensely fluorescent 
spots" within perinuclear cytoplasm. Skalhegg et al (1997) 32 also reported a granular 
distribution of RII in both human B and T lymphocytes. Also, the time frame of fusion 
protein redistribution in response to forskolin addition reported here, corresponds well 
to the observation of dissociation of microinjected RIa 2 Ca 2 holoenzyme in response to 
forskolin within 1-2 minutes 26 and the dissociation of endogenous RII 2 C : in response to 
forskolin observed by immunofluorescence after less than 5 min 2 ~. 

In contrast with previous work with microinjected RIla 2 Ca 2 holoenzyme and Ca 
subunit 21 , we did not observe any translocation of C-GFP LT to the nucleus. A possible 
explanation could be the increased size of the fusion protein relative to endogenous C 
subunit. Nuclear pores are thought to allow passage by diffusion of globular proteins of 
less than 45-60 kDa 33 . The putative size limit of 45-60 kDa may adequately explain the 
exclusion of the fusion protein (68 kDa), yet passage of endogenous C subunit (41 kDa). 
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Consistent with this, a microinjectcd 65 kDa fusion protein of glutathione S-transferase 
and mouse Ca subunit (GST-C) was excluded from the nucleus 21 . 

That the C-GFP L1 fusion can be released by dbcAMP or treatments which increase 
[cAMP], suggests that it must recognise and attach to endogenous R subunits (or some 
subset of the same) and therefore that these R subunits are naturally collected at or on 
the structures seen in Figures 3A and 6. Reversal of elevated [cAMP] ( , e.g. by removal 
of forskolin or stimulation of G-coupled receptors, results in rapid return of 
fluorescence to the original prestimulatory locations within cytoplasm. These anchoring 
structures therefore appear to be persistent features within the cytoplasm of CHO/C- 
GFP LT cells. Similar structures and C-GFP LT behaviour were also found in transfected 
BHK cells. 

The distribution of fluorescence between aggregates and cytoplasm should reflect 
the position of a dynamic equilibrium within each cell, determined principally by 
[cAMP] ( . This is confirmed by results from spot-photobleach measurements. The rate of 
fluorescence recovery of aggregates following photobleach measures the net rate of 
turnover of C subunits at these sites. The rate of recovery is the sum of on and off rates 
for the association of catalytic with regulatory subunits at these loci, both of which will 
be governed principally by the concentration of cAMP within the cell (the off rate being 
governed directly by [cAMP],; the on rate being dependent on the concentration of free 
C-GFP LT in the cytoplasm). Most aggregates completely disappear after full stimulation 
with forskolin. However, often one aggregate remains, and this is always the biggest and 
brightest from the unstimulated cell. Nevertheless, as photobleaching can demonstrate, 
there is active turnover of C-GFP LT even at these large fluorescent aggregates which 
remain in fully stimulated cells. As a further observation, there appears to be 
considerable mobility of catalytic subunits within the structure of an aggregate, since a 
stationary laser beam (approx. 0.5-1.0 jam diameter) is able to bleach fluorescence from 
an entire aggregate of 2-3 jam diameter in 2 to 5 seconds. 

The lack of colocalisation of C-GFP LT and ceramide fluorescence, the position of 
aggregates within the cell and their unusual form, suggest that these structures are 
definitely not associated with Golgi, but may well be constructed of membrane tubules 
with C-GFP L1 on the outer surface. Although we have been unable as yet to ascertain the 
identity of these structures, we have ruled out Golgi membranes. They may however be 
membranous since fusion protein is apparently freely mobile on them, possible tubular 
judging by the 3-D recontructed image, and clearly the catalytic subunits are able to 
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bind to and release from R subunits with ease, suggesting that the latter are anchored to 
the surface of these structures. They are also persistent within the cytoplasm, and found 
in all cells transfected thus far with the C-GFP LT construct (CHO, HEK293 and BHK). 

Figure 5 gives a comparison of an SPA assay conducted in parallel with two 
different forskolin dose response experiments using the cAK fusion protein. These 
experiments showed a direct correlation of three parameters: level of [cAMP],, turnover 
rate of C-GFP LT at cytoplasmic aggregates, and overall degree of fusion protein 
redistribution. Data from these three greatly varying methods agree on an V^-maximal 
concentration for forskolin of between 1.7 to 9.3 [xM in this system. As these results 
show, the cAK fusion protein represents a novel and reliable probe by which dynamic 
changes in [cAMP], can be measured in intact living cells as they respond to 
extracellular signals. 



Experimental protocol 

Hybrid cDNA construction 

Hybrid cDNAs encoding NH 2 - and COOH-terminal fusions of murine Ca subunit 34 to 
GFP LT were inserted into the multiple cloning site of the pZeoSV (Invitrogen Corp., San 
Diego, CA, USA) mammalian expression vector, generating the fusion constructs C- 
GFP LT and GFP LT -C. Briefly, cDNAs encoding C and GFP LT were amplified by PCR 
using the following primers: 5'-C, 

TTGGACACAAGCTTTGGACACCCTCAGGATATGGGCAACGCCGCCGCCGCC 

AAG; 3 '" C ' 
GTCATCTTCTCGAGTCTTTCAGGCGCGCCCAAACTCAGTAAACTCCTTGCCA 

cac ; 5 '- GFpL1 < 

TTGGACACAAGCTTTGGACACGGCGCGCCATGAGTAAAGGAGAAGAACTTT 

TC and 3'-GFP L \ 

GTCATCTTCTCGAGTCTTACTCCTGAGGTTTGTATAGTTCATCCATGCCATGT 
Hindlll/AscI restriction endonuclease digested C subunit PCR amplification product 
and Ascl/Xhol digested GFP LT PCR product were ligated with the Hindlll/Xhol digested 
vector for the generation of the C-GFP lT fusion construct. Correspondingly the GFP LT -C 
construct was generated by ligating HindIlI/Bsu36I digested GFP LT PCR product and 
Bsu36I/XhoI digested C subunit PCR product with the Hindlll/Xhol digested vector. To 
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generate a similar construct which allowed the expression of GFP LT alone, the GFP 1 ' 1 
PCR product was digested with Hindlll/Xhol and ligated with the Hindlll/Xhol 
digested vector. 

Cell cultures 

CHO cells were transfected with the vectors containing hybrid cDNA for the C-GFP LT 
or the GFP LT -C fusion proteins using the calcium phosphate precipitate method in 
HEPES-buffered saline 35 . Stable transfectants were selected using 1000 Zeocin/ml 
(lnvitrogen) in the growth medium (DMEM with 1000 mg glucose/1, 10 % foetal bovine 
serum (FBS), 100 jag penicillin-streptomycin mixture ml 1 , 2 mM L-glutamine 
purchased from Life Technologies Inc., Gaithersburg, MD, USA). Untransfected CHO 
cells were used as the control. To assess the effect of glucagon on fusion protein 
redistribution, the constructs were stably expressed in BHK/GR cells (Novo Nordisk, 
Bagsvaerd, Denmark) overexpressing the human GR. Untransfected BHK/GR cells were 
used as the control. Expression of GR was maintained with 500 jag G418/ml (Neo 
marker) and C-GFP LT was maintained with 500 (ag Zeocin/ml (Sh ble marker). CHO 
cells were also simultaneously co-transfected with vectors containing cDNAs for C- 
GFP LT and the human ARa2a (ATCC). Transfected cells are referred to as e.g. CHO/C- 
GFP LT cells in the text. 

For fluorescence microscopy, cells were allowed to adhere to Lab-Tek chambered 
coverglasses (Nalge Nunc Int., Naperville, IL, USA) for at least 24 hours and cultured to 
about 80% confluence. Prior to experiments, the cells were cultured over night without 
selection pressure in HAM's F12 medium with glutamax (Life Technologies), 100 \xg 
penicillin-streptomycin mixture ml" 1 and 0.3 % FBS. This medium has low 
autofluorescence enabling fluorescence microscopy of cells straight from the incubator. 

Immunoblotting 

Samples containing 10 jag of protein, determined according to the method of Bradford 36 
using the Bio-Rad Protein Assay (Bio-Rad Laboratories, Hercules, CA, USA), were 
added to SDS sample buffer" and run on precast 7.5 % SDS-PAGE gels with a 4 % 
stacking gel (Bio-Rad). The proteins were transferred to PH79 nitrocellulose 
membranes (Scleicher & Schuell GmbH., Dassel, Germany) for an hour at 4°C using a 
Bio-Rad Transfer Blot apparatus (80 V). Non-specific adhesion was blocked by 
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incubating the membranes over night in 3 % bovine serum albumin Fraction V (Sigma 
Chemical Company, St. Louis, MO, USA) in Tris-buffered saline (TBS) containing 50 
mM Tris pH 7.5 and 0.15 M NaCl and for an hour in 3 % skim milk powder (Difco 
Laboratories, Detroit, MI, USA) in TBS with 0.1 % Tween20 (TBST). The membranes 
were incubated for an hour in TBST with 3 % skim milk powder and the primary 
polyclonal rabbit anti-Ca antibody (Upstate Biotechnology Inc., Lake Placid, NY, 
USA), which was raised against a peptide corresponding to a 16 amino acid N-terminal 
stretch of human Cot, diluted 1:1000. After 4 washes of 5 min each with TBST, 
secondary antibody (horse radish peroxidase-conjugated donkey anti-rabbit 
immunoglobulin from Amersham International pic, Buckinghamshire, UK) diluted 
1:5000 in TBS with 3 % skim milk powder was added and incubated for an hour. After 
4 washes in TBST and one in TBS, immunoreactivity was detected by enhanced 
chemiluminescence (ECL) as described by the manufacturer (Amersham) and exposed 
on Biomax® MR film (Eastman Kodak Company, Rochester, NY, USA). All the steps 
were performed at room temperature unless otherwise stated. 

Time-lapse recording of fusion protein movement. 

Cells were cultured in HAM's F12 medium as described above. The chambers were 
placed on a temperature regulated microscope stage and kept at 37°C. Fluorescence 
images were captured using an Axiovert 135 inverted light microscope (Carl Zeiss, 
Oberkochen, Germany) equipped with a Fluar x40, NA 1.3 oil immersion objective 
(Zeiss) and a cooled (-40°C) CHI charged coupled device (CCD) camera (Photometries 
Ltd., Tucson, AZ, USA). The microscope was equipped with a 470±20 nm excitation 
filter, a 505 nm dichroic mirror and a 515±15 nm emission filter (Delta Lys & Optik, 
Lyngby, Denmark). The excitation light source was a 100W HBO arc lamp. 

Redistribution of the C-GFP LT fusion protein was quantified using an image analysis 
program custom written in LabVIEW (National Instruments, Austin, TX, USA). 
Fluorescent aggregates are segmented from each image using an automatically found 
threshold based on maximisation of the information measure between the object and the 
background. The a priori entropy of the image histogram is used as the information 
measure 37 . The area occupied by aggregates in each image is calculated by counting 
pixels in the segmented areas. The value thus obtained for each image in a series, or 
treatment pair, is normalised to the value found for the first (unstimulated) image 
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collected. A value of zero (0) indicates no redistribution of fluorescence from the 
starting condition. A value of one (1) by this method equals full redistribution. 

Spot photobleaching 

A Zeiss LSM 410 with x40 Fluar (as above) was used in spot scan mode at 488 nm to 
bleach individual fluorescent C-GFP L1 aggregates within CHO cells variously treated 
with forskolin. Fluorescence recovery at the locus of each aggregate was monitored 
immediately after bleach with successive small-area raster scans just large enough to 
include most of the cell in which the aggregate lay. Nominal output of the laser at 488 
nm, before launch into the microscope, was 10 mW. Subsequent raster scans were also 
run with the laser at full intensity and without a confocal aperture to allow the first to be 
made within 0.2 seconds of bleach, and for each scan to be completed within 0.3 
seconds (100 x 100 pixels per scan). The recovery of fluorescence for the majority of 
bleach experiments was measured over a period of 215 seconds, recorded in three 
consecutive blocks of 10 scans having successive intervals between frames of 0.5, 1 and 
5 seconds, and a final set of 15 scans each 10 seconds apart. A single scan collected 
prior to each bleach exposure served both to establish depth of bleach and to estimate 
maximum recoverable fluorescence in each experiment. Bleach recovery scans (8-bit 
images) were analysed using IPlab Spectrum software (Signal Analytics Corp., Vienna, 
VI, USA). A small region of interest (ROI) of between 6x6 to 10x10 pixels was used to 
define the area for which fluorescence recovery would be monitored in each experiment, 
and the average fluorescence within that ROI was measured for successive frames in 
each time series. The measurement ROIs were slightly larger than the bleached C-GFP LT 
aggregates to allow for cytoplasmic movements during the measurement period. The 
total average fluorescence within each frame was also measured to allow fluorescence 
recovery within C-GFP LT aggregates to be corrected for the minor effects of 
photobleaching caused by the series of measurement scans. 

Results of the spot-bleach experiments are presented as normalised values of 
displacement from photobleach, AF(t), versus time t: 

AF(t) - [F(oo) - F(t)]/[F(oo) - F(0)] 

where F(oo) = F.R/R f 
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F(oo) being the maximum recoverable fluorescence within a measurement ROI 
calculated from the pre-bleach intensity of the target aggregate, F,, corrected for total 
loss of fluorophores within the cell, R/R r , during the bleach exposure and recovery 
periods. 

SPA 

CHO/C-GFP LT cells were cultured in HAM's F12 medium as described above, but in 
96-well plates. The medium was exchanged with Ca 2+ -HEPES buffer containing 100 jiM 
IBMX. The cells were stimulated with different concentrations of forskolin for 10 min. 
Reactions were stopped with addition of NaOH to 0.14 M and the amount of cAMP 
produced was measured with the cAMP-SPA kit, RPA538 (Amersham) as described by 
the manufacturer. 

Automated imaging 

A Diaphot300 microscope (Nikon Corp., Tokyo, Japan) coupled to a camera based on 
the SITe back illuminated 512 x 512 CCD camera (Princeton Instruments Inc., Trenton, 
NJ, USA) and integrated with a digital data acquisition system using LabVIEW 
software was configured to allow automated focusing and image-based analyses in 96- 
well plates. CHO/C-GFP LT cells were cultured as described above but in 96-well plates 
and kept at 37°C throughout the experiments. A fluorescence micrograph of the same 
field of cells, initially chosen at random, was acquired before and 30 min after forskolin 
stimulation and analysed as described above. 

Endomembrane labelling with fluorescently tagged ceramides 

Golgi membranes in CHO/C-GFP LT cells were labelled with ceramide-FL (Molecular 
Probes Inc., Eugene, OR, USA) at 0.5 jaM for 20 minutes before washing. Ceramide-FL 
excited at 480 nm normally emits in the green at about 510 nm, but when concentrated 
(as in Golgi membranes) the fluorophore forms excimers, resulting in a shift in the 
emission maximum to greater than 600 nm 3 '. Images were collected at both 520 ± 10 nm 
and beyond 570 nm, allowing good separation of GFP LT and ceramide-FL signals. 
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Structure of the GFP^ T -bright aggregates 

Through-focus images of individual C-GFP U aggregates were collected from chilled 
cells with a x63 NA 1.4 oil-immersion objective. The built-in focus motor of the Zeiss 
LSM 410 was used to advance the objective 0.2 jam between images, 25 images per data 
set. Effective pixel size in the images was 65.6 nm. Data sets were corrected for 
bleaching and fluctuations in illumination intensity. Out-of-focus information in the 
images was removed using iterative, constrained, three-dimensional deconvolution 
(DeltaVision from Applied Precision Inc., Seattle, WA, USA) based on a theoretically 
calculated point-spread function. The deconvolved images were then reconstructed into 
a 3-D rotational projection of 40 images (9 degrees between images) using the method 
of maximum intensity ray-tracing (DeltaVision, Applied Precision, Inc., Seattle, USA). 
Two adjacent images in this set, re-sized and pixel-smoothed, were used to create the 
stereo pair shown in Figure 6C. An iso-surface rendering of the 3-D reconstruction was 
created using Milan software (BitPlane AG, Zurich, Switzerland) (Fig. 6B). 
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Figure legends 



Table 1. Time from initiation of a response to half maximal (t 1/2max ) and maximal (t max ) C- 
GFP LT redistribution. The data was extracted from curves such as shown in Figure 3B. 
All t,,„ and t values are eiven as mean±SD and are based on a total of 26-30 cells 

I/2max max 0 

from 2-3 independent experiments for each forskolin concentration. Since the observed 
redistribution is sustained over time, the t max values were taken as the earliest time point 
at which complete redistribution is reached. Note that the values do not relate to the 
degree of redistribution. 

Figure 1. Western blot analysis of lysates containing C-GFP LT fusion proteins. Total 
lysates of BHK/GR,C-GFP LT (A) and control BHK/GR (B) cells were probed with an 
anti-Ca antibody. 500 ng of purified bovine C subunit (C) was included as a positive 
control and to identify the endogenous C subunit. Although the antibody clearly reacts 
unspecifically with several proteins in the total lysates, the fusion protein (f) is 
recognised as a specific band, migrating with an apparent size of 60 kDa, in the 
transfected cells (A). The endogenous C subunit (e) migrated as predicted by its 
molecular weight of 41 kDa. It is possible to compare the expression levels of 
endogenous hamster C subunit and overexpressed mouse fusion proteins in these blots 
since the immunogenic peptide is conserved between these two species. 

Figure 2. Fluorescence micrographs of CHO cells expressing C subunit fusion proteins. 
The two fusion proteins of the C subunit of cAK show distinct localisation patterns. A. 
The NH,-terminal GFP LT -C fusion protein is localised almost evenly throughout the 
cytoplasm. B. The COOH-terminal C-GFP L1 fusion protein is highly concentrated in 
cytoplasmic aggregates, often in one large and several minor structures per cell. Scale bar 
10 urn. 

Figure 3. Time-lapse analyses of fluorescence redistribution in CHO/C-GFP lT cells 
treated with various agonists. The raw data of each experiment consisted of 60 
fluorescence micrographs acquired at regular intervals including several images acquired 
before the addition of agonist. Six of these images are shown (A) for the typical response 
to 1 \jM forskolin, taken at the time points indicated. The time point t=0 corresponds to 
the image acquired immediately before the cells were challenged with agonist. Scale bar 
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10 jam. The charts (B-G) each show a quantification of the responses in each time series. 
The total area of the highly fluorescent aggregates (see Experimental Protocol) is plotted 
versus time for each experiment. (B) Redistribution time profiles of the C-GFP L1 fusion 
following treatment of cells with various concentrations of forskolin. (C) Response 
following addition of 1 mM dbcAMP. (D) The effect of 100 |iM IBMX on the fusion 
protein distribution. (E) Demonstrates the reversibility of the forskolin-induced 
redistribution of C-GFP LT , where 10 |iM forskolin (open arrow) is followed shortly by 
repeated washings with buffer (dark arrow). In a parallel experiment, treatment with 10 
jaM forskolin plus 100 jaM IBMX is followed by repeated washing with buffer 
containing 100 jjM IBMX. (F) BHK7GR,C-GFP LT cells treated with 100 nM glucagon. 
(G) CHO/C-GFP LT cells transiently transfected with the ARa2a were pretreated with 10 
jaM forskolin (open arrow) to increase [cAMP], then given 10 \xM norepinephrine in the 
continued presence of forskolin. 

Figure 4. (A) Four frames from the recovery sequence following spot photobleach of a 
large aggregate (arrow) in a CHO/C-GFP L1 cell exposed to 25 |iM forskolin. Times are 
seconds after bleach. (B) Normalised displacement curves of the fluorescence recovery 
process in cells exposed to various levels of forskolin. Measurement points are 
averages±sem (n=4). (C) Linear fits to the first five points of the normalised recovery 
curves shown in (B). The slope of each line is used as an estimate of the half-time of 
recovery from bleach at each forskolin concentration. 

Figure 5. Parallel dose response analyses of forskolin effects in CHO/C-GFP LT cells on: 
[cAMP], elevation (□), the rate of recovery from spot photobleach (A) and induced 
change in C-GFP LT redistribution (•). [cAMP], was measured by SPA assay, analysing 
the effects of buffer or 8 increasing concentrations of forskolin in these cells. The graph 
shows a trace of the mean±sem expressed in arbitrary units (n=4 for each data point). 
Half times for recovery from spot photobleach were estimated from the first 5 time 
points of the mean value (n=4) curves in Figure 4B. Changes induced in C-GFP LT 
distribution were quantified as described (Experimental Protocol) using fluorescence 
micrographs taken of the same field of cells prior to and 30 min after the addition of 
forskolin. The graph shows a trace of the mean±sem at each forskolin concentration (n=8 
for each data point). The fitted curves indicate ^-maximal concentration values for 
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forskolin as: 1.7 jiM, image-based assay (□); 3.0 jiM, spot photobleach assay (A); 9.3 
HM, SPA (•). 

Figure 6. (A) Two images of CHO/C-GFP LT cells stained with ceramide-FL, in emission 
ranges of 520 ± 10 nm and >570 nm, have been superimposed to demonstrate the distinct 
separateness of Golgi membranes (orange) and C-GFP LT fluorescence (green). Scale bar 
is 10 jam. (B) An iso-surface rendering of a single large C-GFP LT aggregate (similar to 
that arrowed in 6A). The image is a reconstruction from 25 through-focus images 
deconvolved and processed as described (Experimental Protocol). Scale bar 1 ^im. (C) 
Stereo pair of the reconstructed images used to generate the iso-surface seen in (B). Each 
image is smoothed for presentation, the structure originally being 35 pixels high by 27 
wide in this orientation. Scale bar 1 jam. 
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Table 1 



[forskolin]/|iM 


tl/2max / S 


tmax / S 


1 


1 15±21 


310±31 


10 


69±14 


224±47 


50 


47±10 


125±28 
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